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AAAAATTGAG 
TGGATATTGA 
AAGTATAGCA 
TGGCTCGAAA 
AGTTCTACTT 
TTAGTTTTAA 
GTTTAAATAA 
TTAAGGAGCA 
GGATTATTTC 
TGAGTCAAAG 
AAAACGCAAG 
TTACTATTCG 
CTCCCGAGTC 
CAGTTCTGAA 
AATATCTTAT 
CATGCAAGTT 
TGAATTTATA 
CTGATTTAGC 
TTATATCTTT 
ATTTCAAAAT 
TAGCAACGAA 
CAAAAATTCC 
TTTTTCATTT 
TTAGCTGGAA 
CTTAGCTTAT 
TAAGGAAACA 
TTTTAGAGTC 
GATATTGCAA 
ATCTTGATTG 
AAGAAATAAA 
TAGAGATCAG 
ACAATAT^AAA 
CCAGTTAGCA 
GAAGAAATAA 
TTGGGGTTTT 



FIG. 11 



+ 



+ 



7/78 





FIG. 8 



+ 



+ 



8/78 





1 


AAAACCCCAA 


AACCCCAAAA 




51 


AACCTCAGTA 


TTAATAAGCT 




101 


ATGGAGGTTG 


ATGTTGATAA 




151 


TCTTAAGACT 


TGTGAAGAAA 




201 


TCCAGAAAGT 


TATTAGATGA 




251 


TTAGAAGATA 


TTAAAATATT 




301 


AGACTATAAT 


GAAGAAGATT 




351 


CAACTGGACT 


AATGATCGAA 




401 


TCATCAAGCG 


ATGTTTCAGA 




451 


ACTTAAGGGA 


AATCAATTAG 




501 


CTCAAAAGCA 


GTATTTCTTT 




551 


ATTGGAAATG 


AGCTCTTCCG 




601 


GCGAACTTCT 


GAAGGAACTC 




651 


ATCATTTGAA 


AGTCAACGAT 




701 


GCAGACATGA 


ATGAACCTCG 




751 


GAATGAGAAA 


GATCACTTTC 




801 


ATATGAAATC 


AAGAACCAGA 


IP 


851 


AACCAATTCT 


TCAAAAAGCA 




901 


AGCGATGGAC 


AGAGCTCAGA 




951 


GAATTAGAAA 


GAAGCTAAAA 


ill 

I xsr 


1001 


CTTGAGAAAG 


TCAAAGATTT 




1051 


TCCTCTTCCA 


GAAAATTGGC 


1101 


TAAATAAAAC 


TAGAGAAGAA 


•a? ; 


1151 


TACACAACTG 


ATAATAAATG 


55; 


1201 


CAATATACTC 


CCCAAAGACT 




1251 


AAAAGAAAGT 


TAAGAAATAT 




1301 


AAAAACTTAT 


TGCTTGAGAA 


i.'s 


1351 


GGTTGAGACC 


TCTGCAAAGC 




1401 


ACGTCTTATG 


GAAATTGCTC 


i , 


1451 


CTGATTAGAT 


GATTTTTCTA 


b 

! "St? 


1501 


AACCTATTAC 


TACAGAAAGA 


1551 


TCGCAGACTT 


AAAGAAGGAA 


1601 


GAAGAATGGA 


AAAAGTCGCT 




1651 


ACCGAAGAAA 


ACTACTTTCC 




1701 


TAAATTCAGA 


CCGGAAGACT 




1751 


AACTCTCACT 


TAATGCTTAA 




1801 


TTTTGGATTC 


GCTGTTTTTA 




1851 


AGTTTGTTTG 


CAAATGGAAG 




1901 


ACTATGGATA 


TCGAAAAGTG 




1951 


AACATTCCTA 


AAAACTACTA 




2001 


CTGCACAAAT 


TCTAAAGAGA 




2051 


TTTAGAAAGA 


AAGAAATGAA 




2101 


TGCACTTGAA 


GGAGGACAAT 




2151 


AACAAAATGA 


CTTAAATGCA 




2201 


AGAAATTATT 


TTAAGAAAGA 




2251 


CCAATATAAT 


TACATTAACT 




2301 


GAATTCCTCA 


AGGTCTTTGA 




2351 


GCAACATTAG 


AGGAAAGCTC 



CCCCTTTTAG 
CAGATTTTAA 
TCAAGCTGAT 
TTAAAGAAGC 
AGAAATCAAT 
TGCGCAGACA 
TTAAAGTTAT 
CTTATTGACA 
TAGACAAAAA 
CAAAGACCCA 
CAAGACGAAT 
ACATCTCTAC 
TTGTTCAATT 
AAGTTTGACA 
ATGTTGATCA 
TCAACAACAT 
ATATTTTATT 
TGAGTTTGTG 
CGATATTCAC 
GATAAGGTTA 
TAACTTCAAC 
GGGAACGGAA 
AAGTCGAAGT 
CGTCACACAA 
TTTTGACTGG 
GTGGAACTAA 
GATCAATACA 
ATTTTTATTA 
CGATGGATAT 
TGTCACCGAG 
ATATTTGGGA 
ACGCTTGCTG 
TGGATTTGCA 
GTCCAATTAT 
ACAAAATTAA 
GACATTGAAG 
ACTATGATGA 
CAAGTTGGAC 
ATATGATAGT 
AATTACTTTC 
AAGAATAACA 
AGATTATTTT 
ATCCAACCTT 
AAGAAAACAT 
TAACTTACTT 
TTAATGGGAA 
GTTTCATCAA 
CTTAGGATTC 
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CTTCTTTGCA 
AAAAACTATC 
TGGATTATGA 
TTCGAAAAAC 
TCCAGAAGAT 
CTTGAAAATG 
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TTAATATTTG 
CAAACAAAAG 
ATTTTATTAT 
AATCAATGAA 
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GACTTTGCGA 
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TGGAGAGGAG 
TTGAAATATT 
CTCAAGGCAA 
TCAATATGAT 
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CCCCAAAACCCCAAAACCCCAAAACCCCTATAAAAAAAGAAAAAATTGAGGTAGTTTAGA 

1 + + + + + + 60 

GGGGTTTTGGGGTTTTGGGGTTTTGGGGATATTTTTTTCTTTTTTAACTCCATCAAATCT 

PQNPKTPKPL*KKKKLR*FR 
PKTPKPQNPYKKRKN*GSLE- 
PKPQNPKTPIKKEKIEVV*K- 

AATAAAATATTATTCCCGCACAAATGGAGATGGATATTGATTTGGATGATATAGAAAATT 

1 + + + + + + 120 

TTATTTTATAATAAGGGCGTGTTTACCTCTACCTATAACTAAACCTACTATATCTTTTAA 

NKILFPHKWRWILIWMI*KI 
IKYYSRTNGDGY * FG* YRKF- 
♦NIIPAQMEMDIDLDDIENL- 

TACTTCCTAATACATTCAACAAGTATAGCAGCTCTTGTAGTGACAAGAAAGGATGCAAAA 

1 + + - + + + + 180 

ATGAAGGATTATGTAAGTTGTTCATATCGTCGAGAACATCACTGTTCTTTCCTACGTTTT 

YFLIHSTSIAALVVTRKDAK 
TS*YIQQV*QLL* *QERMQN- 
LPNTFNKYSSSCSDKKGCKT- 

CATTGAAATCTGGCTCGAAATCGCCTTCATTGACTATTCCAAAGTTGCAAAAACAATTAG 

1 + + + + + + 240 

GTAACTTTAGACCGAGCTTTAGCGGAAGTAACTGATAAGGTTTCAACGTTTTTGTTAATC 

H*NLARNRLH*LFQSCKNN* 
lEIWLEIAFIDYSKVAKTIR- 
LKSGSKSPSLTI PKLQKQLE- 

AGTTCTACTTCTCGGATGCAAATCTTTATAACGATTCTTTCTTGAGAAAATTAGTTTTAA 

1 + + + + + + 300 

TCAAGATGAAGAGCCTACGTTTAGAAATATTGCTAAGAAAGAACTCTTTTAATCAAAATT 

SSTSRMQIFITILS*EN*F* 
VLLLGCKSL*RFFLEKISFK- 
FYFSDANLYNDSFLRKLVLK- 

AAAGCGGAGAGCAAAGAGTAGAAATTGAAACATTACTAATGTTTAAATAAAATCAGGTAA 

1 + + + + + + 360 

TTTCGCCTCTCGTTTCTCATCTTTAACTTTGTAATGATTACAAATTTATTTTAGTCCATT 

KAESKE*KLKHY*CLNKIR* 
KRRAKSRN*NITNV* IKSGN- 
SGEQRVEIETLLMFK*NQVM- 

TGAGGATTATTCTATTTTTTAGATCACTTCTTAAGGAGCATTATGGAGAAAATTACTTAA 

1 + + ^ ^ + + 420 

ACTCCTAATAAGATAAAAAATCTAGTGAAGAATTCCTCGTAATACCTCTTTTAATGAATT 

*GLFYFLDHFLRSIMEKIT* 
EDYSIF*ITS*GALWRKLLN- 
RI ILFFRSLLKEHYGENYLI- 
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TACTAAAAGGTAAACAGTTTGGATTATTTCCCTAGCCAAC7VATGATGAGTATATTAAATT 

421 + + + + + + 480 

ATGATTTTCCATTTGTCAAACCTAATAAAGGGATCGGTTGTTACTACTCATATAATTTAA 

a Y*KVNSLDYFPSQQ**VY*I 

b TKR*TVWIISLANNDEYIKF- 

C LKGKQFGLFP*PTMMSILNS- 

CATATGAGAATGAGTCAAAGGATCTCGATACATCAGACTTACCAAAGACAAACTCGCTAT 

481 + + + + + + 540 

GTATACTCTTACTCAGTTTCCTAGAGCTATGTAGTCTGAATGGTTTCTGTTTGAGCGATA 

a HMRMSQRISIHQTYQRQTRY 

b I*E*VKGSRYIRLTKDKLAI- 

C YENESKDLDTSDLPKTNSL*- 

AAAACGCAAGAAAAAGTTTGATAATCGAACAGCAGAAGAACTTATTGCATTTACTATTCG 

541 + + + + + + 600 

I . TTTTGCGTTCTTTTTCAAACTATTAGCTTGTCGTCTTCTTGAATAACGTAAATGATAAGC 

Q a KTQEKV**SNSRRTYCIYYS 

tH b KRKKKFDNRTAEELIAFTIR- 

C NARKSLIIEQQKNLLHLLFV- 

y s 

:£! TATGGGTTTTATTACAATTGTTTTAGGTATCGACGGTGAACTCCCGAGTCTTGAGACAAT 

l4l 601 + + + + + + 660 

!: ATACCCAAAATAATGTTAACAAAATCCATAGCTGCCACTTGAGGGCTCAGAACTCTGTTA 

i-.n 

iJl a YGFYYNCFRYRR*TPES*DN 

b MGFITIVLGIDGELPSLETI- 
C WVLLQLF*VSTVNSRVLRQL- 



TGAAAAAGCTGTTTACAACTGAAGGAATCGCAGTTCTGAAAGTTCTGATGTGTATGCCAT 

'JT 661 - + + + + + + 720 

^ ACTTTTTCGACAAATGTTGACTTCCTTAGCGTCAAGACTTTCAAGACTACACATACGGTA 

;==i a *KSCLQLKESQF*KF*CVCH 

b EKAVYN*RNRSSESSDVYAI- 
ly c KKLFTTEGIAVLKVLMCMPL- 

TATTTTGTGAATTAATCTCAAATATCTTATCTCAATTTAATGGATAGCTATAGAAACAAA 

721 + + + + + + 780 

ATAAT^CACTTAATTAGAGTTTATAGAATAGAGTTAAATTACCTATCGATATCTTTGTTT 

a YFVN*SQISYLNLMDSYRNK 

b II.*INLKYLISI*WIAIETN- 

C FCELISNILSQFNG*L*KQT- 

CCAAATAAACCATGCAAGTTTAATGGAATATACGTTAAATCCTTTGGGACAAATGCACAC 

781 + + + + + 840 

GGTTTATTTGGTACGTTCAAATTACCTTATATGCAATTTAGGAAACCCTGTTTACGTGTG 

a PNKPCKFNGIYVKSFGTNAH 

b QINHASLMEYTLNPLGQMHT- 

C K*TMQV*WNIR*ILWDKCTL- 

TGi\ATTTATATTGGATTCTTAAAGCATAGATACACAGAATGCTTTAGAGACTGATTTAGC 

841 + + + + + + 900 

ACTTAAATATAACCTAAGAATTTCGTATCTATGTGTCTTACGAAATCTCTGACTAAATCG 

a *IYIGFLKHRYTECFRD*FS 

b EFILDS*SIDTQNALETDLA- 

C N LYWILKA*IHRML*RLI*L- 
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TTACAACAGATTACCTGTTTTGATTACTCTTGCTCATCTCTTATATCTTTAAAAGAAGCA 

901 + + + + + + 960 

AATGTTGTCTAATGGACAAAACTAATGAGAACGAGTAGAGAATATAGAAATTTTCTTCGT 

a LQQITCFDYSCSSLISLKEA 

b YNRLPVLITLAHLLYL*KKQ- 

C TTDYLF*LLLLISYIFKRSR- 

GGCGAAATGAAAAGAAGACTAAAGAAAGAGATTTCAAAATTTGTTGATTCTTCTGTAACC 

961 + + + +- - + + 1020 

CCGCTTTACTTTTCTTCTGATTTCTTTCTCTAAAGTTTTAAACAACTAAGAAGACATTGG 

a GEMKRRLKKEISKFVDSSVT 

b AK*KED*RKRFQNLLILL*P- 

C RNEKKTKERDFKIC*FFCNR- 

GGAATTAACAACAAGAATATTAGCAACGAAAAAGAAGAAGAGCTATCACAATCCTGATTC 

1021 + + + + + + 1080 

CCTTAATTGTTGTTCTTATAATCGTTGCTTTTTCTTCTTCTCGATAGTGTTAGGACTAAG 

a GINNKNISNEKEEELSQS*F 

b ELTTRILATKKKKSYHNPDS- 

C N*QQEY*QRKRRRAITILIL- 

TTAAAGATTTCAAAAATTCCAGGTAAGAGAGATACATTCATTAAAATTCATATATTATAG 

1081 + + + + -f + 1140 

AATTTCTAAAGTTTTTAAGGTCCATTCTCTCTATGTAAGTAATTTTAAGTATATAATATC 

a LKISKIPGKRDTFIKIHIL* 

b *RFQKFQVREIHSLKFIYYS- 

c KDFKNSR*ERYIH*NSYIIV- 

TTTTTCATTTCACAGCTGTTATTTTCTTTTATCTTAACAATATTTTTTGATTAGCTGGAA 

1141 + + + + + + 1200 

AAAAAGTAAAGTGTCGACAATAAAAGAAAATAGAATTGTTATAAAAAACTAATCGACCTT 

a FFISQLLFSFILTIFFD*LE 

b FSFHSCYFLLS*QYFLISWK- 

C FHFTAVIFFYLNNIF*LAGS- 

GTAAAAAGTATCAAATAAGAGAAGCGCTAGACTGAGGTAACTTAGCTTATTCACATTCAT 

1201 + + + + + + 1260 

CATTTTTCATAGTTTATTCTCTTCGCGATCTGACTCCATTGAATCGAATAAGTGTAAGTA 

a VKSIK*EKR*TEVT*LIHIH 

b *KVSNKRSARLR*LSLFTFI- 

C KKYQIREALD*GNLAYSHS*- 

AGATCGACCTTCATATATCCAATACGATGATAAGGAAACAGCAGTCATCCGTTTTAAAAA 

1261 + + + + + + 1320 

TCTAGCTGGAAGTATATAGGTTATGCTACTATTCCTTTGTCGTCAGTAGGCAAAATTTTT 
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MEIENNQAQQPKAEKLWWELELEMQENQNDIQVRVKIDDPKQYL 
VNVTAACLLQEGSYYQDKDERRYIITKALLEVAESDPEFICQLA 
VY I RNEL Y I RTTTNY I VAFC WHKNTQ P F I E KY FNKAVLL PNDL 
LEVCEFAQVLYIFDATEFKNLYLDRILSQDIRKELTFRKCLQRC 
VRSKFSEFNEYQLGKYCTESQRKKTMFRYLSVTNKQKWDQTKKK 
RKENLLTKLQAIKESEDKSKRETGDIMNVEDAIKALKPAVMKKI 
AKRQNAMKKHMKAPKIPNSTLESKYLTFKDLIKFCHISEPKERV 
YKILGKKYPKTEEEYKAAFGDSASAPFNPELAGKRMKIEISKTW 
ENELSAKGNTAEVWDNLISSNQLPYMAMLRNLSNILKAGVSDTT 
HSIVINKICEPKAVENSKMFPLQFFSAIEAVNEAVTKGFKAKKR 
ENMNLKGQIEAVKEWEKTDEEKKDMELEQTEEGEFVKVNEGIG 
KQY INS I ELAI KI AVNKNLDE I KGHTAI FS DVSGSMSTSMSGGA 
KKYGSVRTCLECALVLGLMVKQRCEKSSFYIFSSPSSQCNKCYL 
EVDLPGDELRPSMQKLLQEKGKLGGGTDFPYECIDEWTKNKTHV 
DNIVILSDMMIAEGYSDINVRGSSIVNSIKKYKDEVNPNIKIFA 
VDLEGYGKCLNLGDEFNENNYI KI FGMSDS I LKFI SAKQGGANM 
VEVIKNFALQKIGQK 

FIG. 20 

MSRRNQKKPQAPIGNETNLDFVLQNLEVYKSQIEHYKTQQQQIK 
EEDLKLLKFKNQDQDGNSGNDDDDEENNSNKQQELLRRVNQIKQ 
QVQLIKKVGSKVEKDLNLNEDENKKNGLSEQQVKEEQLRTITEE 
QVKYQNLVFNMDYQLDLNESGGHRRHRRETDYDTEKWFEISHDQ 
KNYVS I YANQKTS YCWWLKDYFNICNNYDHLNVS INRLETEAEFY 
AFDDFSQT I KLTNNS YQTVNI D VNFDNNLC I LALLRFLLS LERF 
NILNIRSSYTRNQYNFEKIGELLETIFAWFSHRHLQGIHLQVP 
CEAFQYLVNSSSQISVKDSQLQVYSFSTDLKLVDTNKVQDYFKF 
LQEFPRLTHVSQQAI PVS ATNAVENLNVLLKKVKHANLNLVS I P 
TQFNFDFYFVNLQHLKLEFGLEPNILTKQKLENLLLSIKQSKNL 
KFLRLNFYTYVAQETSRKQILKQATTIKNLKNNKNQEETPETKD 
ETPSESTSGMKFFDHLSELTELEDFSVNLQATQEIYDSLHKLLI 
RSTNLKKFKLSYKYEMEKSKMDTFIDLKNIYETLNNLKRCSVNI 
SNPHGNISYELTNKDSTFYKFKLTLNQELQHAKYTFKQNEFQFN 
NVKSAKIESSSLESLEDIDSLCKSIASCKNLQNVNIIASLLYPN 
NIQKNPFNKPNLLFFKQFEQLKNLENVSINCILDQHILNSISEF 
LEKNKKIKAFILKRYYLLQYYLDYTKLFKTLQQIiPELNQVYINQ 
QLEELTVSEVHKQVWENHKQKAFYEPLCEFIKESSQTLQLIDFD 
QNTVSDDSIKKILESISESKYHHYLRLNPSQSSSLIKSENEEIQ 
ELLKACDEKGVLVKAYYKFPLCLPTGTYYDYNSDRW 

FIG. 22 



MKILFEFIQDKLDIDLQTNSTYKENLKCGHFNGLDEILTTCFAL 
PNSRKIALPCLPGDLSHKAVIDHCIIYLLTGELYNNVLTFGYKI 
ARNEDVNNSLFCHSANVNVTLLKGAAWKMFHSLVGTYAFVDLLI 
NYTVIQFNGQFFTQIVGNRCNEPHLPPKWVQRSSSSSATAAQIK 
QLTEPVTNKQFLHKLNINSSSFFPYSKILPSSSSIKKLTDLREA 
I FPTNLVKI PQRLKVRINLTLQKLLKRHKRLNYVS ILNS ICPPL 
EGTVLDLSHLSRQSPKERVLKFIIVILQKLLPQEMFGSKKNKGK 
IIKNLNLLLSLPLNGYLPFDSLLKKLRLKDFRWLFISDIWFTKH 
NFENLNQLAI CFIS WLFRQLI PKI I QTFFYCTE I S STVTI VYFR 
HDTWNKLITPFIVEYFKTYLVENNVCRNHNSYTLSNFNHSKMRI 
I PKKSNNEFRI I AI PCRGADEEE FTI YKENHKNAI QPTQKI LEY 
LRNKRPTSFTKIYSPTQIADRIKEFKQRLLKKFNNVLPELYFMK 
FDVKSCYDSIPRMECMRILKDALKNENGFFVRSQYFFNTNTGVL 
KLFNWNAS RVP KP YEL Y I DNVRTVHLSNQD VI NWEME I FKTA 
LWVEDKCYIREDGLFQGSSLSAPIVDLVYDDLLEFYSEFKASPS 
QDTLILKLADDFLIISTDQQQVINIKKLAMGGFQKYNAKANRDK 
ILAVSSQSDDDTVIQFCAMHIFVKELEVWKHSSTMNNFHIRSKS 
SKGIFRSLIALFNTRISYKTIDTNLNSTNTVLMQIDHWKNISE 
CYKSAFKDLSINVTQNMQFHSFLQRIIEMTVSGCPITKCDPLIE 
YEVRFTILNGFLESLSSNTSKFKDNIILLRKEIQHLQAYIYIYI 

F/G. 23 
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tgaatatttc 
atatatttta 
aaaaaatcg 



taattaataa 
ccataggctc 
tacaaaagcc 
aagcttttaa 
gaagaaaaca 
caagtttaat 
gatgaaaaca 
attactgaag 
ttaaatgaga 
tggtttgaaa 
tcatattgtt 
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ctctgtatac 
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tctgagtcta 
aaatctgaaa 
gtaaaagcat 
tcagatagat 
tttgcttatt 
gttatttaat 



ataaaaaaaa 
ctataggcaa 
agattgagca 
agttcaaaaa 
actcaaataa 
tgataaaaaa 
aaaagaatgg 
aataggttaa 
gtggtggcca 
tatctcatga 
ggtggcttaa 
gactagaaac 
ctaataattc 
tcgcattgct 
cttatacaag 
cagttgtctt 
tctaatattt 
actctttctc 
tcttataaga 
ctaacgctgt 
tagtttctat 
aattagagtt 
tgagtataaa 
cttaagaaac 
acaataaaaa 
gtggtatgaa 
acttgtaagc 
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aatttaagct 
aattttaatt 
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aagcattcat 
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aattgactgt 
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actataaatt 
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atttgaataa 
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gcaaactaca 
tgaaacaaat 
ttataagacc 
ttaagattag 
ataataagaa 
agttggttct 
actttctgaa 
gtattaaaat 
tagaagacac 
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agattatttt 
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attttttgat 
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gttcaaatta 
taagaatatt 
tcatggaaac 
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taataacgtt 
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caagcaattt 
gcatatactt 
tttgaaaaga 
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gagtgaagta 
atgtgagttt 
tgtaagtgat 
ttatttgaga 
ttaagaactt 
ccctctatgt 
taaatattag 
tacatacaat 
aagtaaataa 



aagaaaatgt 
cttgattttg 
tagtagtaat 
gatggaaact 
ttattaagga 
aaggtagaga 
tagcaagtga 
ttagtattta 
agaagagaaa 
tatgtatcaa 
aataaaaaca 
ttctatgcct 
gttaacatag 
ttatcactag 
aattttgaga 
cacttacaag 
tcatcataaa 
aaattagttg 
ttgactcatg 
aatgttttac 
ttcaattttg 
ccaaatattt 
aatcttaaat 
cagatattaa 
actcctgaaa 
catctttctg 
atttatgata 
agttacaaat 
tatgaaacct 
atttcttatg 
taagaattat 
aaaagtgcaa 
aaatctattg 
cccaacaata 
gaataattga 
aattctattt 
tattatttat 
cctgaattaa 
cataagtaag 
atcaaagaat 
gactctatta 
ttgaacccta 
ctcaaagctt 
ttaccaactg 
tttaaataaa 
agtcattttt 
ttatttttca 



caaggcgtaa 
tattacaaaa 
agatcaaaga 
ctggcaacga 
gagtcaatta 
aagatttgaa 
aagaagagta 
acatggacta 
cagattatga 
tttacgccaa 
attatgatca 
ttgatgattt 
acgttaattt 
aaagattcaa 
aaattggtga 
gcattcattt 
ttagcgttaa 
acactaacaa 
taagctagta 
ttaaaaaggt 
atttctactt 
tgacaaaaca 
ttttaagatt 
aacaagctac 
ctaaagatga 
aattaaccga 
gcttgcacaa 
atgaaatgga 
taaacaatct 
aactgacaaa 
aacacgctaa 
aaattgaatc 
cttcttgtaa 
tttagaaaaa 
aaaatttgga 
cagaattctt 
tacaatatta 
attaagttta 
tatgggaaaa 
catcctaaac 
aaaagatttt 
gttaatctag 
gcgacgaaaa 
gtacttatta 
tattaaatat 
agtgttttga 
atcatttttt 



FIG. 21 
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Oxytricha LCVSYILSSFYYANLEENALQFLRKESMDPEKPETNLLMRLT 
Euplotes LCVSSILSSFYYATLEESSLGFLRDESMNPENPNVNLLMRLT 

FIG. 24 



ATTTATACTCATGAAAATCTTATTCGAGTTCATTCAAGACAAGCTTGACATTGATCTACA 
GACCAACAGTACTTACAAAGAAAATTTAAAATGTGGTCACTTCAATGGCCTCGATGAAAT 
TCTAACTACGTGTTTCGCACTACCAAATTCAAGAAAAATAGCATTACCATGCCTTCCTGG 
TGACTTAAGCCACAAAGCAGTCATTGATCACTGCATCATTTACCTGTTGACGGGCGAATT 
ATACAACAACGTACTAACATTTGGCTATAAAATAGCTAGAAATGAAGATGTCAACAATAG 
TCTTTTTTGCCATTCTGCAAATGTTAACGTTACGTTACTGAAAGGCGCTGCTTGGAAAAT 
GTTCCACAGTTTGGTCGGTACATACGCATTCGTTGATTTATTGATCAATTATACAGTAAT 
TCAATTTAATGGGCAGTTTTTCACTCAAATCGTGGGTAACAGATGTAACGAACCTCATCT 
GCCGCCCAAATGGGTCCAACGATCATCCTCATCATCCGCAACTGCTGCGCAAATCAAACA 
ACTTACAGAACCAGTGACAAATAAACAATTCTTACACAAGCTCAATATAAATTCCTCTTC 
TTTTTTTCCTTATAGCAAGATCCTTCCTTCATCATCATCTATCAAAAAGCTAACTGACTT 
GAGAGAAGCTATTTTTCCCACAAATTTGGTTAAAATTCCTCAGAGACTAAAGGTACGAAT 
TAATTTGACGCTGCAAAAGCTATTAAAGAGACATAAGCGTTTGAATTACGTTTCTATTTT 
GAATAGTATTTGCCCACCATTGGAAGGGACCGTATTGGACTTGTCGCATTTGAGTAGGCA 
ATCACCAAAGGAACGAGTCTTGAAATTTATCATTGTTATTTTACAGAAGTTATTACCCCA 
AGAAATGTTTGGCTCAAAGAAAAATAAAGGAAAAATTATCAAGAATCTAAATCTTTTATT 
AAGTTTACCCTTAAATGGCTATTTACCATTTGATAGTTTGTTGAAAAAGTTAAGATTAAA 
GGATTTTCGGTGGTTGTTCATTTCTGATATTTGGTTCACCAAGCACAATTTTGAAAACTT 
GAATCAATTGGCGATTTGTTTCATTTCCTGGCTATTTAGACAACTAATTCCCAAAATTAT 
ACAGACTTTTTTTTACTGCACCGAAATATCTTCTACAGTGACAATTGTTTACTTTAGACA 
TGATACTTGGAATAAACTTATCACCCCTTTTATCGTAGAATATTTTAAGACGTACTTAGT 
CGAAAACAACGTATGTAGAAACCATAATAGTTACACGTTGTCCAATTTCAATCATAGCAA 
T^TGAGGATTATACCAAAAAAAAGTAATAATGAGTTCAGGATTATTGCCATCCCATGCAG 
AGGGGCAGACGAAGAAGAATTCACAATTTATAAGGAGAATCACAAAAATGCTATCCAGCC 
CACTCAAAAAATTTTAGAATACCTAAGAAACAAAAGGCCGACTAGTTTTACTAAAATATA 
TTCTCCAACGCAAATAGCTGACCGTATCAAAGAATTTAAGCAGAGACTTTTAAAGAAATT 
TAATAATGTCTTACCAGAGCTTTATTTCATGAAATTTGATGTCAAATCTTGCTATGATTC 
CATACCAAGGATGGAATGTATGAGGATACTCAAGGATGCGCTAAAAAATGAAAATGGGTT 
TTTCGTTAGATCTCAATATTTCTTCAATACCAATACAGGTGTATTGAAGTTATTTAATGT 
TGTTAACGCTAGCAGAGTACCAAAACCTTATGAGCTATACATAGATAATGTGAGGACGGT 
TCATTTATCAAATCAGGATGTTATAAACGTTGTAGAGATGGAAATATTTAAAACAGCTTT 
GTGGGTTGAAGATAAGTGCTACATTAGAGAAGATGGTCTTTTTCAGGGCTCTAGTTTATC 
TGCTCCGATCGTTGATTTGGTGTATGACGATCTTCTGGAGTTTTATAGCGAGTTTAAAGC 
CAGTCCTAGCCAGGACACATTAATTTTAAAACTGGCTGACGATTTCCTTATAATATCAAC 
AGACCAACAGCAAGTGATCAATATCAAAAAGCTTGCCATGGGCGGATTTCAAAAATATAA 
TGCGAAAGCCAATAGAGACAAAATTTTAGCCGTAAGCTCCCAATCAGATGATGATACGGT 
TATTCAATTTTGTGCAATGCACATATTTGTTAAAGAATTGGAAGTTTGGAAACATTCAAG 
CACAATGAATAATTTCCATATCCGTTCGAAATCTAGTAAAGGGATATTTCGAAGTTTAAT 
AGCGCTGTTTAACACTAGAATCTCTTATAAAACAATTGACACAAATTTAAATTCAACAAA 
CACCGTTCTCATGCAAATTGATCATGTTGTAAAGAACATTTCGGAATGTTATAAATCTGC 
TTTTAAGGATCTATCAATTAATGTTACGCAAAATATGCAATTTCATTCGTTCTTACAACG 
CATCATTGAAATGACAGTCAGCGGTTGTCCAATTACGAAATGTGATCCTTTAATCGAGTA 
TGAGGTACGATTCACCATATTGAATGGATTTTTGGAAAGCCTATCTTCT^AACACATCAAA 
ATTTAAAGATAATATCATTCTTTTGAGAAAGGAAATTCAACACTTGCAAGC 



FIG. 26 
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AKFLHWLMSVYWELLRSFFYVTETTFQKNRLFFYRKSVWSKLQSIGIRQHLKR 
VQLRDVSEAEVRQHREARPALLTSRLRFIPKPDGLRPIVNMDYWGARTFRREK 
RAERLTSRVKALFSVLNYERA 



FIG. 27 



GCCAAGTTCCTGCACTGGCTGATGAGTGTGTACGTCGTCGAGCTGCTCAGGTC 
TTTCTTTTATGTCACGGAGACCACGTTTCAAAAGAACAGGCTCTTTTTCTACC 
GGAAGAGTGTCTGGAGCAAGTTGCAAAGCATTGGAATCAGACAGCACTTGAAG 
AGGGTGCAGCTGCGGGACGTGTCGGAAGCAGAGGTCAGGCAGCATCGGGAAGC 
CAGGCCCGCCCTGCTGACGTCCAGACTCCGCTTCATCCCCAAGCCTGACGGGC 
TGCGGCCGATTGTGAACATGGACTACGTCGTGGGAGCCAGAACGTTCCGCAGA 
GAAAAGAGGGCCGAGCGTCTCACCTCGAGGGTGAAGGCACTGTTCAGCGTGCT 
CAACTACGAGCGGGCGCG 



FIG, 28 



MTEHHTPKSRILRFLENQYVYLCTLNDYVQLVLRGSPASSYSNICERLRSDVQTSFSIFLHSTWGF 
DSKPDEGVQFSSPKCSQSELIANWKQMFDESFERRRNLLMKGFSMNHEDFRAMHVNGVQNDLVSTF 
PNYLISILESKNWQLLLEIIGSDAMHYLLSKGSIFEALPNDNYLQISGIPLFKNNVFEETVSKKRKR 
TIETSITQNKSARKEVSWNSISISRFSIFYRSSYKKFKQDLYFNLHSICDRNTVHMWLQWIFPRQFG 
LINAFQVKQLHKVIPLVSQSTWPKRLLKVYPLIEQTAICRLHRISLSKVYNHYCPYIDTHDDEKILS 
YSLKPNQVFAFLRSILVRVFPKLIWGNQRIFEIILKDLETFLKLSRYESFSLHYLMSNIKISEIEWL 
VLGKRSNAKMCLSDFEKRKQIFAEFIYWLYNSFIIPILQSFFYITESSDLRNRTVYFRKDIWKLLCR 
PFITSMKMEAFEKINENNVRMDTQKTTLPPAVIRLLPKKNTFRLITNLRKRFLIKMGSNKKMLVSTN 
QTLRPVASILKHLINEESSGIPFNLEVYMKLLTFKKDLLKHRMFGRKKYFVRIDIKSCYDRIKQDLM 
FRIVKKKLKDPEFVIRKYATIHATSDRATKNFVSEAFSYFDMVPFEKWQLLSMKTSDTLFVDFVDY 
WTKSSSEIFKMLKEHLSGHIVKIGNSQYLQKVGIPQGSILSSFLCHFYMEDLIDEYLSFTKKKGSVL 
LRWDDFLFITVNKKDAKKFLNLSLRGFEKHNFSTSLEKTVINFENSNGIINNTFFNESKKRMPFFG 
FSVNMRSLDTLLACPKIDEALFNSTSVELTKHMGKSFFYKILRSSLASFAQVFIDITHNSKFNSCCN 
lYRLGYSMCMRAQAYLKRMKDIFIPQRMFITDLLNVIGRKIWKKLAEILGYTSRRFLSSAEVKWLFC 
LGMRDGLKPSFKYHPCFEQLIYQFQSLTDLIKPLRPVLRQVLFLHRRIAD 



FIG. 29 
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Wild Type Telomerase Gene 
Wild Type Telomerase Gene 



1. Transform with linear fragment 
containing the telomerase gene 
disrupting with a LEU2 or ura4 marker 



M 



Telomerase Gene::M 



Wild Type Telomerase Gene 




Wild Type Telomerase Gene 



Wild Type (X2) 



2. Assay in selective media 



3. Sporulate, and grow on 
selective media 




Senescence? (X2) 



(These cells will show a senescence phenotype 

if the disrupted gene encodes a telomerase subunit.) 



FIG. 43 



48/78 



m 



l7Bjn: 



t^BJn: 
t7ejn::|.zay^ 



ill 




t^Bjn:: 1,291/^1 



S 

X 



eg 



s 

X 



o 



c 

CD 

E 

<0 



J3 

"cm" 



+ 



CO 
CO 



CM t 

f 

CvJ 



+ 



in 
oo 
QQ 




+ 



49/78 



lU- 
IJl 



heterozygote diploid 
wt diploid 
wt haploid 



Tetrad 



C C <D <D 



O) D> O) Q 

in O to O CM 

CM in 1^ T- 

< 1 I I I 



Generations 
after sporulation 



Telonneric 
Apa I 
fragment 




600 bp 
500 bp 
400 bp 

300 bp 
200 bp 



I 100 bp 



FIG. 45 



+ 



50/78 



























CO 




00 




CO 




00 




03 




(N 




in 










o 


O 


O 


o 


o 


o 


o 


o 


o 


o 


GO 


iH 








n 












ro 


VO 


O 


m 


VO 


00 


o 


VD 






o 


CO 


vr> 




CM 


o 


CD 


in 


o 


O 


o 


o 


iH 


O 


•H 


O 


OJ 


VO 


ro 


o 




rH 




CM 


00 






ro 






ir> 


vo 


r- 


GO 


oo 


a\ 


rH 


CM 






iH 


V£> 


rH 


00 


iH 


00 


H 


rH 


rH 


rH 


rH 


rH 



i 5^ 




OOVO^OJOOOVD^OJOOO 

rH(Nro^^Lnvor^oooo 



u 

01 
Dl 

cn 

4-) 

nj 

4-) 

rd 

4J 

(IJ 
(0 
4-» 
U 

01 

m 

01 
01 
01 

u 
*j 

(0 

*j 

u 

4-> 
4J 

^ 

4-> 
4-) 

nj 

o> 

4J 
4J 
4-» 
4J 
01 
4J 
4J 
4J 
4J 
4J 



rd 

4-> 

rd 
4J 
Ol 

Eh CO 

u o 

u 

H CO 

CJ 
CD 

H O 



a> rH 

cr» 00 



CD 

a 
< 

CD 
O 

CJ pti 

CD pa 

H 
Eh 

Eh 
O 

< CO 

CD Q 

CJ 
H 
Eh 

< 2 



Eh 



Eh 

^2 



o o 

CD > 

< 
CD 

CD CD 
O 



CD > 

O EC 
CD 

CJ 

a 

CD < 



6 

u 

Eh 

Eh 

H I 



CD Q 

CD W 
Eh 

^» 

cn 

fU 

n3 
(d 

4J 
(0 

Oi 
u 
u 
<d 
fd 

4-) 
4J 
4J 

cd 

4J 

CJ1 

■u 

4-» 

VO 

O rH 
rH 



51/78 



CM CO 

in ^ 



O O 
O 

H 3: 



H CO 

o 
< 

U 

El CO 

H 
Eh 

a J 



a 

H 

u 



o in 
vo in 



D1 
03 

4-> 

o» 

u 
cn 

4J 

u 
as 
01 
05 
nJ 
u 

OJ 

nj 
Di 

4J 
4-) 
4-> 
U 
OJ 

u 

U 

cn 

4-) 

cn 

4J 

S" 

cn 
CD 
u 
u 
«J 

4J 
OJ 

rO 
OJ 

cn 



























VO 


in 


CN 


in 


00 m 




in 


o 


in 


vo 


in 


fM in 


VO 


r- 




as 


i-f 


00 


ro 


cr> 






vo 


O 00 


rH 


f-H 


iH 




t-H CN 


iH 


CN 


iH 


CN 


H 


CN 


CN CN 


U 




O 






Eh 








O 




< 
























Eh 




Q 


O 


> 






>H 


o 








o > 










U 


o 








< 




O 










O 


u 








u 




Eh 






< 




u pc; 


H 


CO 


Eh 






Cu 












u 


6 




Eh 
Eh 




^' 














o 
















O 








o < 


Eh 


CO 


H 




Eh 


Ix, 




H 








u 
















H 










1 














U 




o 




< CO 




Pei 








M 


Sm 






Eh 












•J 


o 




C5 






Eh 




^ :^ 


I 








CD 




Eh 


O 


<< 


Eh 














Eh 


IS 


CD > 



























< M 

£h 
O 

< CO 

g 



Eh CO 

< 
Eh 

< 

O 
< 

H 



CD 
H 
CD > 

Eh 



Eh 

gi 

< 
H 

< M 

O 
CD 

CD O 
CJ 

Eh CO 
H 



Eh 



U 

< Eh 

t-< 
Eh 

o 
u 

H CO 

I. 
s 

CD W 

Eh 

CJ 

o 

< 
O 



H 
Eh 

H 



u 

% 

H 
Eh 
H 

CD 
O 
< 



I: 



Eh CO 
Eh 

< M 

u 
o 

< CO 



01 

4J 

4-> 
4-> 
4-> 
fU 
<TJ 
4-> 
U 

m 
fd 

4-) 
(0 

u 

4J 

u 
u 

4J 

CO 

4J 
4J 

cn 
u 

4J 

u 
m 

01 

CD Q 



O O 

O 
CD 

CD 
H 

o 
< 

CD > 

< Eh 



CD 
O 

H 

O Q 
O 

H O 



O 
H 

Eh J 



CJ CD* 

CD 
Eh 

O > 

CD < 
O 



00 in 
o o 
CN n 



< Eh 



CD W 
H 

Eh 
U 

U 

CD > 



u 

Eh 

CJ J 

g 



CJ 
CJ 

C-> I 

CD 
H 

o : 

H 
Eh 
CD 



^ in 

iH CN 

CN m 



Eh 

Eh 

o 

CD 

Eh CJ 

CJ 
Eh 

CD > 



Eh CO 

CJ J 

Eh 
CJ 

H CO 

Eh 
Eh 

CD 

CJ OcJ 
CJ K 



o in 



CD 
O 

O < 

H 
H 
Eh 

CD > 

u o 
u 

o 
u 

U ft 



CD 

3: 



Eh ^ 

U 
CJ 

Eh CO 
Eh 

rcC CO 

u 



CD Q 



vo in 
CN vo 
CN n 



O 

o 
u 

Eh 
CD 

C!) CD 

CD 
CD 

Eh is 

CJ 
H 

< M 



Eh 

CJ 

O ft 
H 

O > 

< 
CD 

U Cd 

Eh 

CD > 

Eh 

CJ J 



Eh 








CD 




CD 






O 




CD 




Eh 




Eh 


< 


CJ 




CJ 


CJ 




t 




Eh 








g 


o 








H 




Eh 


u 


Eh 




o 


H 


CO 


CD 




< 




g 


O 


< rd 


£h 




^ 




< 


M 






CJ J 


CD Q 


H CO 


H 








CJ 




H 






o 




Eh 




H 




< 


H 


< 


U 




H 








CJ 




H 






u 




Eh 








CD 




CD 




O 


CD 


> 


Eh 




CD 




CJ 


»^ 




H 


CO 


Eh 




Eh 


CO 


CD a 


< CO 


CJ OiJ 




CJ ^ 


CJ 




CD 








CJ 






Eh 




1 




CJ 




Eh 






U 


H 


H 




Eh 




i 










H 










Eh 






CJ 


H 


O 




Eh 






Q 








O 


> 






g 




H fa 




< Eh 


U J 


Eh 




Eh 




Eh 








< 






CD 




< 






< 




U 


H 






Eh 












CJ 










H 






CJ 


CJ 


< 


H 




Q 


CJ 


►J 


% 


CO 


% 




Eh CO 


CD 




:l 




Eh 




u o 


Eh CO 


CD < 


CD Q 


Eh fa 


o 


cri 


o 




CN 




CN 




vo 


(N 


vo 


CN 


VO 


00 


vo 


00 vo 


00 vo 


00 vo 


00 vo 


00 vo 




CN 


m 




O 


in 


VO 


r- 


CN cn 


00 


fH 




m 


o 




vo vo 


CN 00 


00 o 


CN 


o ^ 




rH 


in 


rH 


vo 




VO 




rH 




CN 


GO 


CN 


CA 


<N 


CTt OJ 


O CN 


o n 


rH rO 


CN ro 










iH 




tH 




i-t 


rH 








rH 




rH 


CN 


CN 


CM 


CN 



CO 



Q 
LU 
3 



U- O 
O 



+ 



+ 



52/78 '\ 





vo 


in 


m 


in 


in 




in 




in 


in 


vx> 










00 




m in 


m in 


in 


CM in 


00 in 




in 


o 


in 


in 


ro in 


o 




vo CM 


CN 


CN 


00 


rH 


m 


ro cy» 


^ o 


in (N 


in ^ 








CO 




00 iH 


as 


CM 


0> ^ 


o 


VO 


o 


GO 


CN n 






<>j ^ 


CN ^ 


CN 




CN 




CN ^ 


CM in 


CN 


in 


CN in 


ro 


in 


ro 


m 


Eh 






















nj 










4J 




O 


O 


< 


O 












a 


O 


4J 




o 






(U 




< E- 




o 


o 








% 






o 


4-) 




H 


§ 




(U 






o o 


o < 










pa 




< Eh 






o > 




pa 


4J 





1^ 



• g s 



s 

4^ 



:x3 a 



3 

O PQ 
U 

Eh 

O 
< 

cn 
nJ 
o 
u 

ro 
U 

4-> 
OJ 

cn 
u 
m 



u 

rd 
U 
O 
(0 

4J 
4J 

fO 

u 

4J 
4J 

nJ 
to 
CO 
CO 

4J 

CO 

■p 
O) 

CO 
4J 



Eh J 
< M 

Eh 

00 \s> 

vo vo 

CN ro 



Eh 3c 



I pa 



H 
H 

H Pu 

Eh 
Eh 

Eh Pu 

H 
U 

Eh CO 

% 

Eh 

< l-H 

H 
U 

< M 

< 

Eh 

< M 
Eh 

£h 

Eh Cli 

8 

Eh O) 

I. 

< 

o 
o 

u 
u 

Eh 

% 

O PP 

vo vo 

CN CN 

in 'St* 

CN 



3: 







0 


_ 

CH 






Eh 


fcH 


CJ 0 




<C M 








t_l 

CH 


_ 

fcH 




CJ 






<I Eh 


O Q 










_ 

CH 








5 .J 




CD Q 




0 pa 








CU 




fcH 




O 






P3 


CH WI4 






u 


EH 


u 


CD 


Eh 


CJ 




Eh {xi 










fcH 






Eh 






CD ^ 


Eh >h 


0 pa 


Eh 


u 


Eh 


Eh 






O J> 




_ 


Eh 




CJ 


CJ 






< Eh 












< 


0 


CO 






u 


vj OS 




4-> 






4J 




id! 


1 ^ 




3 


4J 




Eh CO 


CO 






iJ 




< 


CO 






CO 




< £h 


4J 






u 


< 


Eh 


cn 


H 


Eh 


as 




< M 


(0 






(0 




Eh 


ro 


% 


Eh 


(0 


O Q 




t; 






01 




U 


4-) 




a 


4-) 




CJ a* 


4J 






iJ 




< 


4J 






4-) 


Eh CO 


CJ cki 


03 






4-> 








% El) 




(0 




go 


CO 






CO 






4-) 


U 


Eh 


4J 


< Eh 


Eh 


4J 






4-> 


a 




OJ 






4-) 




CJ J 




VD VO 


VD VD 


VD VD 


00 ^ 


^ VO 


0 CO 


in ^ 


VO ^ 


P- ^ 


CN 


CN 


CN 



Eh 
O 

U J 
Eh 

CD 
CJ 

Eh 

O 
O 

< Eh 
Eh 

CD 

CD 

Eh 
O 

< 

O 

o ^ 

Eh 

< M 

O > 

CJ 
H 

a 

CJ O4 

CD 
Eh 

CJ 

< H 

VD VD 

x> ^ 

CM 



pa 



cn 

CD 

CD CD 

Eh 

H PL. 

CD 
CJ 

o 

6 a: 

CD 

Eh 
Eh 

u 

Eh 
H 

CJ J 
< 

CD Q 

! 

Eh 
H 

Eh Pu 

Eh 
CJ 

< H 

CJ J 

H 
Eh 

o 

Eh 

CD > 

00 ro 
.CM vo 

o in 
ro 



CO S 



CD 

u: 



o 
a 



+ 



+ 



53/78 







in 




in 




ro 




in 




in 




CM 




ro 




ro 




ro 








o 




o 




in 


rH 


rH 


iH 




rH 




ro 


o 




vo 


o\ 


ro 


CM 


o\ 


00 


in 


GO 


rH 










CO 


o 


00 


rH 






rH 


CM 


ro 


ro 






in 






in 




m 


o 


vo 










vo 


00 




a\ 




ro 


in 


ro 


VO 


ro 


\o 


ro 




ro 




ro 


vo 


ro 


vo 


ro 


r- 


ro 




ro 




ro 


r- 


ro 




ro 





'ass? 



< M 

< 

O Q 



o 
o 
u 

o 

Eh 



I: 
o 



a 

O Oi 

o 

u 

-p 



OS 
4J 
4-» 

n3 

4-) 

03 
U 
4J 

u 
u 

4J 
4-> 
03 
01 
u 
Ol 

u 

01 

4J 

m 

OS 
4J 



<ri CM 

CO CO 

o in 



H 
U 

< 

U Oi 

U 
< 

o a 

Eh 

< CO 

< Eh 

< 

U 

H 
< 

U 



m 
m 
u 

(TS 
03 
4J 
4-> 
4J 
XJ 
U 
4J 
OS 

m 

01 
01 

4J 
4J 

OJ 

u 
*J 

4-> 
4-> 
4J 
4-> 
4-> 
03 
4-> 
4-) 
AJ 
Ol 
OS 

m 

4J 

Ol 



H 
Eh 

H 
Eh 

u 

Eh CO 

H 
U 

Eh CO 

Eh 

< CO 



u 
u 

< Eh 

o 
o 



u 

03 
03 
4-) 
03 
03 
4-» 
01 
4-) 
4-> 
03 
03 
Ol 
4-> 
4-> 
01 
4-> 
4J 
03 
03 
U 
U 
03 

4-1 

03 
4-J 

Ol 

O 
H 



6 

H CO 

O 
O 

O O 

o o 

Eh 

o 

U 
H 

< M 

H 
O 

O O 

Eh 
Eh 

u > 







< 




Eh 


Du 


C5 > 






O > 






o 


















H 




O 


H 
























H 










>H 






>H 


< M 


























H 












u 












O 












u 


















Q 






£h 


CO 


o ca 




Q 


S PC 


O 




s 












H 














O 


1 


^: 


Eh 






CD 


> 


CD CD 










< 




o 










Eh 




o 






o 




U 




o 


Eh 




U 










u 




O 


< 








H CO 


Eh 
























O 
















o 


Eh 




a 




CO 






Eh 














Eh 






< 


M 


< 


M 


O 




o > 




Q 




















o 


Eh 


















o 




Eh 


H 










CJ 


a; 




> 


< 


CO 


< S 


o 


> 








Eh 




H 




Eh 














CD 










Eh 
















Q 


H 




O 


> 




Eh 


















Eh 




H 


U 






< 


M 










^1 




Eh 


H 










H 




O 


w 








Eh 






CJi 
















Ol 








m 




H 




u 




a 




03 


Eh 




u 


4-> 








u 








4J 


U 




Eh 


u 








u 








4J 


< 


Eh 


CJ ^ 


03 
















4J 








03 




a 












4J 








03 




u 




1 








4-) 








01 




H 


CO 




Q 






4J 




Q 


< 2 


4-> 
















U 








03 








o 




< 




4-> 


s 






0$ 
















4-> 








4-) 












< 




OS 




CO 




u 




VD 




vo 


<N 


vo 


(N 




VO 


o 


vo o 


m 


ro 


in 


CTi 


rH 


rH 




ro 




o 


vo 


vo CO 


ro 


o^ 


H 


in 


CM 


VO 




vo 


ro VO 


<^ 


vo 


vo 


in 


vo 


ro 




ro 




ro 




ro 


ro 




ro 


ro 





8 

Eh CO 

U 

CD W 

^ Q 

H 
H 

< M 

O 
H 

CD Q 

CJ 
Eh 
Eh 

•< 
CD 

tH CJ 

Eh J 

H 
Eh 

Eh tu 

a 

H CO 

Eh CO 

CD 
Eh 

H 
Eh 

< M 

a^ o 
m r« 
ro 



s 

Eh 
Eh 

O 
Eh 

U i-q 

CJ 
H 
Eh 

H 

CD a 
O 

CD Q 

O 
H 

CD > 
< 

CD > 

CJ ck: 

Eh J 

CD 
H 

Eh J 

CD 

O > 

< 
U 

Eh CO 

O CD 



CD 

a 

< H 

Eh 
Eh 

Eh Cju 

in CN 
vo r- 
ro 



CD CD 



Q 

CO UJ 

o 



+ 



54/78 



111 



ill 



O 

VO CO 
a\ r-i 
CO 00 



o 

< 
o 

o 

6 

< Eh 

< 

O Q 

H 

H 
O 

O 
O 



Pi 



H 
U 

o > 

U 

U 
H 

O 

O O 

a 

H 

£h Ex. 
O 

< 
O 



o 

O CTi 





<Tk 
















o% 






00 


00 


00 


00 


00 


CO 




00 


o 


00 




m 


m 




o in 


VX> V£> 


CM VO 


00 VO 


ro 


O 






VO 


CN 


CO 


CN 


o 


m 




^ ro 


^ St* 


in VO 


in 00 


GO 




00 


^ 


00 




00 






^ 




m 


CTl 


^ <r» 


















m 








U 










o 












CD 




u 




4J 




H 


< 






CO 


U 








(13 




o 


< 


01 




a 




O 






H 


CO 




« 


4J 








4J 




< Eh 




< 












u 




o 




01 














H 




< 




nJ 




Eh 




u 


U 


O 


< 




CD 








Eh 




nJ 




Eh 






Eh 


H 


O 


s 


nj 








Eh 




4J 










Eh 


Eh J 






U 












+J 




o 














4J 




U 




a 




4-> 










Eh 








fd 




Eh 




< 




4J 








o 


O 




o 


nc 


ffl 




H 








fU 










O O 


H CO 






Eh 
< 



< 

o 
o 

Eh O 
Eh 

Eh Ci. 

o 

fd 
u 

4J 

nJ 
u 
nj 
u 
n3 
Oi 

4-> 
4J 

cd 

4-» 

fd 
fd 
u 
Oi 
fd 
u 

4J 
4J 

o 
(d 
O) 
CO 
O) 

o 

4J 

o 

O) 

CN VO 

o n 



Eh 

H 

Eh 

O 
Eh 

U > 

O 
< 

< 
o 

o 

Eh 
H 

O > 
< 

< 

CD 

o 
o 

O P4 



o 

H 
Eh 

CJ J 

O Q 

CN VO 

in a\ 



in 

VO en 

VO 00 

fd 

4J 

fd 

4-» 

cn 
fd 
fd 

4J 
U 

fd 



fd 

4-» 



fd 
fd 

fd 
fd 
fd 

4J 

u 

4-> 

cn 

4J 

01 
01 

4-> 
U 

fd 



fd 

4J 



u 
u 

4J 

fd 
u 
fd 

4J 

fd 

4-) 
fXj 
4J 
*J 

fd 



fd 
fd 
o 

4J 
■M 

fd 
u 

4J 

Dl 

g. 

CD Q 

H 

O < 
cr» t> 

CO 00 

in <x\ 



+ 



55/78 



tnininmLnLniOLDLnLn'^j' 

^CMOOOVD^C^OCOVX)^ 



4J 03 m 

U U iJ 
4-» OJ u 

4J OJ o 

4J *J or 

0 4J u 
«3 t7> U 
O) 01 U 
4-) JJ Ol 
4J 01 4-» 
<T} 0} (d 

01 Di nJ 

4J Ol «3 
4J U (0 
4J OJ Ol 
Ol OJ 4-> 

dJ 

4J 4J (d 

4J 4J 4-) 

fTJ (0 

4-) U 4-) 

4J U U 

01 O 4-> 

U 0)4-> 

4J 4J 4J 

U OJ iJ 

4-) U Ol 

<0 oJ 0> 

OJ U 4J 
4J U Ol 
4J O (Q 

rd Ot fd 

rd 4-> (d 

(d u 01 

<d oJ 01 

4-> 4-> (d 

01 u 03 

03 0] 4J 

(d 01 03 

03 4J 03 
4J U Ol 
O) 01 03 

03 4J 4J 
4J 4J 4J 
Ol 4-) 

Ol 03 
4-> U 
01 4-» 
OJ 03 4-> 
01 U 4-) 
01 03 O 
03 01 U 
4-» 4J 4-> 
03 4J U 
4J 03 4-> 

0 01 03 
03 03 03 
4J OJ 4-> 
03 03 U 
U 01 Ol 

01 OJ 01 
rd oJ o3 
03 4-) Ol 

U 4J 4-> 
014J 03 
03 4J U 

03 U 03 

03 03 4J 

U 4J 03 

U 03 03 

U 4J 4-) 

U 4-) Ol 

0 U 03 
03 U 03 

01 4-> 03 
4-» 03 03 
V u u 

01 4J 4-» 



U 
4J 
4J 
4J 



4J U 03 4J 
4J 014J 03 

01 03 4-> 01 

4J 03 4J 01 

01 03 03 4J 

fd 03 u 4-) 

U 4J 4J 03 

U Ol U 4J 

U 01 01 4J 

03 4J 03 03 

4-) 4-) 4-) O 

(d 4-> fd u 
4J 4-> m 01 

03 03 4J 03 

OJ 03 U 03 

4J 01 03 4-> 
Ol U 4-> 4-> 

m u 03 01 

01 03 03 -U 
oJ 0> 01 Ol 

Ol4-> 4J 03 
OJ 014J 4-> 
OJ 01 4J 03 

03 -u 01 (d 
03 4-> 03 03 

01 4J 4J 01 

01 4J 03 03 

03 oJ 01 03 

OJ 03 01 OS 

01 03 03 oJ 

U 4J OJ 01 

0 U 4J 4J 
4J Ol U 01 
rd 01 03 Ol 
4J rd 03 01 

OJ O OJ *J 
U 03 01 03 

01 Ol 03 U 
03 4-) 03 01 
rd 03 03 4J 

4J Ol m 4J 

4-) 03 01 4-> 

rd U U 4J 
01 01 03 oJ 
01 4J 01 03 
01 U 03 U 

Cn 4-> 01 4-> 

Ol4-> 4-) 4-> 

03 U U U 

03 01 OJ 03 

U 03 4J 01 

03 03 U 4-) 

03 U 01 U 

OJ 03 4-) U 

4-> 03 4-) 4-> 

U Ol U 4J 

4J 01 U 4J 

4J 4-> 01 4J 

U 4-> Ol4-> 

4-» 4-> U 4-> 

OS OJ U 0> 

4J 03 Ol 4J 

4J 4J 01 4-» 

nJ 03 4-> 4J 

Ol Ol Ol 4J 

fd Ol Ol Ol 

03 03 4J 01 

01 03 03 nJ 

0 U U OJ 
rd 01 U 4-> 

01 03 4-> OJ 
03 OS OS 4J 
OJ OJ 4-> 03 
03 014J 4-) 

03 03 O) 4J 
03 OS 01 O 
U Ol OS 4-> 
D14J U 01 

4J 4J U 4J 

nj Ol U 03 



U U 03 
U Ol4-> u 

4J 4-> OS U 

U 4J rd OJ 
03 O 4J 

rd 4-> 4J 01 
rd 4-> 01 01 

03 4J 4-> OS 

o3 oJ rd OS 

U 03 01 OJ 

4-> 4-» 4-> O 

4J U 4J U 

U U U 03 

01 01 O 01 

01 03 4-) 4J 
OS 03 OS 01 
Ol 01 Ol 03 
4-) 03 Ol 03 

rd 01 O AJ 

4J 01 D14J 

4-) OJ rd O) 

0 01 u m 

014J u u 

03 U 4J Ol 
03 01 O O 

014J 4J 03 

4J O 03 4J 

4J 01 4J U 

03 OJ Ol 4-) 

4J U OJ u 

4-) 4-> 01 01 

U OS OS 03 

rd 4J 014-) 
rd 4J 01 01 

01 u o u 
03 01 01 4-> 

OS OS 4-» o 

03 4J 03 4-> 
4J OJ 014-) 
U 01 03 U 
4J 01 O 03 
Ol4-> 4-> Ol 

OS OJ OS u 

OS 03 OS 4J 
Ol 03 Ol 4J 
01 01 4J u 
U Ol OS Ol 
Ol OJ 01 u 

OS 01 4-» 4J 

Oj 4J 4J 01 

OS OJ U 4J 

U Ol4-» u 

4J U U OS 

U U 4-) 4J 

U OJ 03 u 

4J u u u 

4-> U 03 4J 

01 4J Ol Ol 
01 4J o3 01 
4-» U 014J 
4J 4J 01 OS 
U OS Ol4-> 
4J 0$ 4-> 01 
4-) 01 4-> 4-> 
4-> 01 03 OS 
03 OJ U OJ 
OS Ol 4-) 03 
03 01 03 Ol 

U OJ 4J 4-> 

U OJ 03 U 
01 OJ OS 4J 
OS 4J rd 4-> 

4-) 4J 0$ 4-» 
01 4J Ol OJ 

03 OS 03 4J 

OS 01 OS u 

t7l4-» 03 4-> 

U U 03 4-> 

U U 03 U 

4J 4J m rd 



CO 2 



CD 

u: 



o 
o 



v^f^ooo^cnorHrsjroro^ 
^^^^^ini/iLnuninin 



56/78 



1 

met ser val tyr val val glu leu leu 
GCCAAGTTCCTGCACTGGCTG ATG AGT GTG TAG GTC GTC GAG CTG CTC 

10 20 
arg ser phe phe tyr val thr glu thr thr phe gin lys asn arg 
AGG TCT TTC TTT TAT GTC ACG GAG ACC ACG TTT CAA AAG AAC AGG 

30 

leu phe phe tyr arg lys ser val trp ser lys leu gin ser ile 
CTC TTT TTC TAC CGG AAG AGT GTC TGG AGC AAG TTG CAA AGC ATT 

40 50 
gly ile arg gin his leu lys arg val gin leu arg glu leu ser 
GGA ATC AGA CAG CAC TTG AAG AGG GTG CAG CTG CGG GAG CTG TCG 

60 

glu ala glu val arg gin his arg glu ala arg pro ala leu leu 
GAA GCA GAG GTC AGG CAG CAT CGG GAA GCC AGG CCC GCC CTG CTG 

70 80 
thr ser arg leu arg phe ile pro lys pro asp gly leu arg pro 
ACG TCC .AGA CTC CGC TTC ATC CCC AAG CCT GAC GGG CTG CGG CCG 

90 

ile val asn met asp tyr val val gly ala arg thr phe arg arg 
ATT GTG AAC ATG GAC TAC GTC GTG GGA GCC AGA ACG TTC CGC AGA 

100 110 

glu lys ala glu arg leu thr ser arg val lys ala leu phe 

GAA AAG ARG GCC GAG CGT CTC ACC TCG AGG GTG AAG GCA CTG TTC 

120 

ser val leu asn tyr glu arg ala arg arg pro gly leu leu gly 
AGC GTG CTC AAC TAC GAG CGG GCG CGG CGC CCC GGC CTC CTG GGC 

130 140 

ala ser val leu gly leu asp asp ile his arg ala trp arg thr 
GCC TCT GTG CTG GGC CTG GAC GAT ATC CAC AGG GCC TGG CGC ACC 

150 

phe val leu arg val arg ala gin asp pro pro pro glu leu tyr 
TTC GTG CTG CGT GTG CGG GCC CAG GAC CCG CCG CCT GAG CTG TAC 

160 170 

phe val lys val asp val thr gly ala tyr asp thr ile pro gin 
TTT GTC AAG GTG GAT GTG ACG GGC GCG TAC GAC ACC ATC CCC CAG 

180 

asp arg leu thr glu val ile ala ser ile ile lys pro gin asn 
GAC AGG CTC ACG GAG GTC ATC GCC AGC ATC ATC AAA CCC CAG AAC 

190 200 

thr tyr cys val arg arg tyr ala val val gin lys ala ala met 
ACG TAC TGC GTG CGT CGG TAT GCC GTG GTC CAG AAG GCC GCC ATG 
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gly thr ser 
GGC ACG TCC 

220 

gin gly ile 
CAG GGG ATC 



leu cys tyr 
CTG TGC TAG 

250 

arg asp gly 
CGG GAC GGG 



thr pro his 
ACA CCT CAC 

280 

arg gly val 
CGA GGT GTC 



val val asn 
GTG GTG AAC 

310 

phe val gin 
TTT GTT CAG 



leu leu asp 
CTG CTG GAT 

340 

tyr ala arg 
TAT GCC CGG 



phe lys ala 
TTC AAG GCT 

370 

arg leu lys 
CGG CTG AAG 



leu gin thr 
CTC CAG ACG 

400 

ala tyr arg 
GCG TAC AGG 



210 
ala arg pro 
GCA AGG CCT 



pro gin gly 
CCG CAG GGC 

240 

gly asp met 
GGC GAC ATG 



leu leu leu 
CTG CTC CTG 

270 

leu thr his 
CTC ACC CAC 



pro glu tyr 
CCT GAG TAT 

300 

phe pro val 
TTC CCT GTA 



met pro ala 
ATG CCG GCC 

330 

thr arg thr 
ACC CGG ACC 



thr ser ile 
ACC TCC ATC 

360 

gly arg asn 
GGG AGG AAC 



cys his ser 
TGT CAC AGC 

390 

val cys thr 
GTG TGC ACC 



phe his ala 
TTT CAC GCA 



ser arg ala 
TCA AGA GCC 



ser ile leu 
TCC ATC CTC 



glu asn lys 
GAG AAC AAG 



arg leu val 
CGT TTG GTG 



ala lys thr 
GCG AAA ACC 



gly cys val 
GGC TGC GTG 



glu asp glu 
GAA GAC GAG 



his gly leu 
CAC GGC CTA 



leu glu val 
CTG GAG GTG 



arg ala ser 
AGA GCC AGT 



met arg arg 
ATG CGT CGC 



leu phe leu 
CTG TTT CTG 



asn ile tyr 
AAC ATC TAC 



cys val leu 
TGT GTG CTG 



thr ser tyr 
ACG TCC TAC 

230 
ser thr leu 
TCC ACG CTG 



leu phe ala 
CTG TTT GCG 

260 
asp asp phe 
GAT GAT TTC 



phe leu arg 
TTC CTC AGG 

290 
val asn leu 
GTG AAC TTG 



ala leu gly 
GCC CTG GGT 

320 
phe pro trp 
TTC CCC TGG 



gin ser asp 
CAG AGC GAC 

350 
leu thr phe 
CTC ACC TTC 



lys leu phe 
AAA CTC TTT 

380 
asp leu gin 
GAT TTG CAG 



lys ile leu 
AAG ATC CTC 

410 
gin leu pro 
CAG CTC CCA 



val gin cys* . 
GTC CAG TGC 



leu cys ser 
CTC TGC AGC 



gly ile arg 
GGG ATT CGG 



leu leu val 
TTG TTG GTG 



thr leu val 
ACC CTG GTC 



arg lys thr 
CGG AAG ACA 



gly thr ala 
GGC ACG GCT 



cys gly leu 
TGC GGC CTG 



tyr ser ser 
TAC TCC AGC 



asn arg gly 
AAC CGC GGC 



gly val leu 
GGG GTC TTG 



val asn ser 
GTG AAC AGC 



leu leu gin 
CTG CTG CAG 



phe his gin 
TTT CAT CAG 
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420 

gin val trp lys asn pro his phe ser cys ala ser ser leu thr 
CAA GTT TGG AAG AAC CCA CAT TTT TCC TGC GCG TCA TCT CTG ACA 

430 440 

arg leu pro leu leu leu his pro glu ser gin glu arg arg asp 
CGG CTC CCT CTG CTA CTC CAT CCT GAA AGC CAA GAA CGC AGG GAT 

450 

val ala gly gly gin gly arg arg arg pro ser ala leu arg gly 
GTC GCT GGG GGC CAA GGG CGC CGC CGG CCC TCT GCC CTC CGA GGC 

460 470 

arg ala val ala val pro pro ser ile pro ala gin ala asp ser 
CGT GCA GTG GCT GTG CCA CCA AGC ATT CCT GCT CAA GCT GAC TCG 

480 

thr pro cys his leu arg ala thr pro gly val thr gin asp ser 
ACA CCG TGT CAC CTA CGT GCC ACT CCT GGG GTC ACT CAG GAC AGC 

490 500 

pro asp ala ala glu ser glu ala pro gly asp asp ala asp cys 
CCA GAC GCA GCT GAG TCG GAA GCT CCC GGG GAC GAC GCT GAC TGC 

510 

pro gly gly arg ser gin pro gly thr ala leu arg leu gin asp 
CCT GGA GGC CGC AGC CAA CCC GGC ACT GCC CTC AGA CTT CAA GAC 

520 530 

his pro gly leu met ala thr arg pro gin pro gly arg glu gin 
CAT CCT GGA CTG ATG GCC ACC CGC CCA CAG CCA GGC CGA GAG CAG 

540 

thr pro ala ala leu ser arg arg ala tyr thr ser gin gly gly 
ACA CCA GCA GCC CTG TCA CGC CGG GCT TAT ACG TCC CAG GGA GGG 

550 560 

arg gly gly pro his pro gly leu his arg trp glu ser glu ala 
AGG GGC GGC CCA CAC CCA GGC CTG CAC CGC TGG GAG TCT GAG GCC <» 

564 
OP 

TGA GTGAGTGTTTGGCCGAGGCCTGCATGTCCGGCTGAAGGCTGAGTGTCCGGCTGAGGC 

CTGAGCGAGTGTCCAGCCAAGGGCTGAGTGTCCAGCACACCTGCGTTTTCACTTCCCCAC 

AGGCTGGCGTTCGGTCCACCCCAGGGCCAGCTTTTCCTCACCAGGAGCCCGGCTTCCACT 

CCCCACATAGGAATAGTCCATCCCCAGATTCGCCATTGTTCACCCTTCGCCCTGCCTTCC 

TTTGCCTTCCACCCCCACCATTCAGGTGGAGACCCTGAGAAGGACCCTGGGAGCTTTGGG 

AATTTGGAGTGACCAAAGGTGTGCCCTGTACACAGGCGAGGACCCTGCACCTGGATGGGG 

GTCCCTGTGGGTCAAATTGGGGGGAGGTGCTGTGGGAGTAAAATACTGAATATATGAGTT 

TTTCAGTTTTGGAAAAAAAAAAAAAAAAAAAAAAAAAA 
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Motif -1 
Ep pl23 
Sp Tezl 
Sc Est2 
Hs TCPl 
consensus 



. LWSLIRCFFYVTEQQKSYSKT . 
. FIIPILQSFFYITESSDLRNRT . 
. LIPKI IQTFFYCTEISSTVTIV . 
. YWELLRSFFYVTETTFQKNRL . 
FFY TE 



Motif 0 
Ep pl23 
Sp Tezl 
Sc Est2 
Hs TCPl 
consensus 



K 

p hhh K hR h R 

. KSLGFAPGKLRLIPKKT- -TFRPIMTFNKKIV 
. QKTTLPPAVIRLLPKKN- -TFRLITNLRKRFL 
. TLSNFNHSKMRI I PKKSNNEFRI I AI PCRGAD 
. ARPALLTSRLRFIPKPD- -GLRPIVNMDYWG 
R PK R I 



Motif A 
Ep pl23 
Sp Tezl 
Sc Est2 
Hs TCPl 
consensus 



AF 

h hDh GY h 
. PKLFFATMDIEKCYDSVNREKLSTFLK. 
.RKKYFVRIDIKSCYDRIKQDLMFRIVK. 
. PELYFMKFDVKSCYDSIPRMECMRILK. 
. PELYFVKVDVTGAYDTIPQDRLTEVIA. 

F D YD 



// 



Motif B 
Ep pl23 
Sp Tezl 
Sc Est2 
Hs TCPl 
consensus 



hPQG pS hh 
. NGKFYKQTKGI PQGLCVSSILSSFYYA . 
.GNSQYLQKVGIPQGSILSSFLCHFYME . 
. EDKCYIREDGLFQGSSLSAPIVDLVYD . 
. RATSYVQCQGIPQGSILSTLLCSLCYG . 
G QG S 



Motif C 
Ep pl23 
Sp Tezl 
Sc Est2 
Hs TCPl 
consensus 



Y 

h F DD hhh 
. PNVNLLMRLTDDYLLITTQENN. . 
. KKGSVLLRWDDFLFITVNKKD . . 
. SQDTLILKLADDFLIISTDQQQ. . 
. RRDGLLLRLVDDFLLVTPHLTH . . 
DD L 



Motif D 
Ep pl23 
Sp Tezl 
Sc Est2 
Hs TCPl 
consensus 



Gh h cK 
. NVSRENGFKFNMKKL . . 
. LNLSLRGFEKHNFST . . 
. KKLAMGGFQKYNAKA . . 
. LRTLVRGVPEYGCW . . 

G 
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pi 

Q 

i Fs 



1 

51 
101 
151 
201 
251 
301 
351 
401 
451 
501 
551 
601 
651 
701 
751 
801 
851 
901 
951 
1001 
1051 
1101 
1151 
1201 
1251 
1301 
1351 
1401 
1451 
1501 
1551 
1601 
1651 
1701 
1751 
1801 
1851 
1901 
1951 
2001 
2051 
2101 
2151 
2201 
2251 
2301 
2351 
2401 
2451 



GCAGCGCTGC 
CCGCGATGCC 
AGCCACTACC 
GCCCCAGGGC 
CGNTGGTGGC 
CCCGCCGCCC 
CCGAGTGCTG 
TCGGCTTCGC 
ACCACCAGCG 
GGGGAGCGGG 
TGGTTCACCT 
TGCGCCTACC 
TCAGGCCCGG 
CAACGGGCCT 
CCAGCCCCGG 
GTTGCCCAAG 
CCGTTGGGCA 
GACCGTGGTT 
CTCTTTGGAG 
GCCGCCAGCA 
GGGACACGCC 
TCCTCAGGCG 
AGGCCCAGCC 
TTCCAGGCCT 
GCGNTACTGG 
CGCAGTGCCC 
GCGGTCACCC 
TGTGGCGGCC 
CTGCTCCGCC 
CTGCCTGCGC 
AACGCCGCTT 
GCCAAGCTCT 
CGCTTGGCTG 
ACCGTCTGCG 
GTGTACGTCG 
GTTTCAAAAG 
TGCAAAGCAT 
CTGTCGGAAG 
GACGTCCAGA 
TGAACATGGA 
GCCGAGCGTC 
CGAGCGGGCG 
ACGATATCCA 
GACCCGCCGC 
CGACACCATC 
AACCCCAGAA 
GCCCATGGGC 
AGACCTCCAG 
GCCCGCTGAG 
GCCAGCAGTG 



GTCCTGCTGC 
GCGCGCTCCC 
GCGAGGTGCT 
TGGCGGCTGG 
CCANTGCNTG 
CCTCCTTCCG 
CT^ANGCTGT 
GCTGCTGGAC 
TGCGCAGCTA 
GCGTGGGGGC 
GCTGGCACGC 
ANGTGTGCGG 
CCCCCGCCAC 
GGAACCATAG 
GTGCGAGGAG 
AGGCCCAGGC 
GGGGTCCTGG 
TCTGTGTGGT 
GGTGCGCTCT 
CCACGCGGGC 
TTGTCCCCCG 
ACAAGNACAC 
TGACTGGCGT 
TGGATGCCAG 
CAAATGCGGC 
CTACGGGGTG 
CAGCAGCCGG 
CCCGAGGAGG 
AGCACAGCAG 
CGGCTGGTGC 
CCTCAGGAAC 
CGCTGCAGGA 
CGCAGGAGCC 
TGAGGAGATC 
TCGAGCTGCT 
AACAGGCTCT 
TGGAATCAGA 
CAGAGGTCAG 
CTCCGCTTCA 
CTACGTCGTG 
TCACCTCGAG 
CGGCGCCCCG 
CAGGGCCTGG 
CTGAGCTGTA 
CCCCAGGACA 
CACGTACTGC 
ACGTCCGCAA 
CCGTACATGC 
GGATGCCGTC 
GCCTCTTCGA 



GCACGTGGGA 
CGCTGCCGAG 
GCCGCTGGCC 
TGCAGCGCGG 
GTGTGCGTGC 
CCAGGTGTCC 
GCGANCGCGG 
GGGGCCCGCG 
CCTGCCCAAC 
TGCTGCTGCG 
TGCGCGNTNT 
GCCGCCGCTG 
ACGCTANTGG 
CGTCAGGGAG 
GCGCGGGGGC 
GTGGCGCTGC 
GCCCACCCGG 
GTCACCTGCC 
CTGGCACGCG 
CCCCCATCCA 
GTGTACGCCG 
TGCGNCCCTC 
TCGGGAGGTT 
GATTCCCCGC 
CCCTGTTTCT 
TTCCTCAAGA 
TGTCTGTGCC 
AGGAACACAG 
CCCCTGGCAG 
CCCCAGGCCT 
ACCAAGAAGT 
GCTGACGTGG 
CAGGGGTTGG 
CTGGCCAAGT 
CAGGTCTTTC 
TTTTCTACCG 
CAGCACTTGA 
GCAGCATCGG 
TCCCCAAGCC 
GGAGCCAGAA 
GGTGAAGGCA 
GCCTCCTGGG 
CGCACCTTCG 
CTTTGTCAAG 
GGCTCACGGA 
GTGCGTCGGT 
GGCCTTCAAG 
GACAGTTCGT 
GTCATCGAGC 
CGTCTTCCTA 



AGCCCTGGCC 
CCGTGCGCTC 
ACGTTCGTGC 
GGACCCGGCG 
CCTGGGANGN 
TGCCTGAANG 
CGCGAANAAC 
GGGGCCCCCC 
ACGGTGACCG 
CCGCGTGGGC 
TTGTGCTGGT 
TACCAGCTCG 
ACCCGAANGC 
GCCGGGGTCC 
AGTGCCAGCC 
CCCTGAGCCG 
GCAGGACGCC 
AGACCCGCCG 
CCACTCCCAC 
CATCGCGGCC 
AGACCAAGCA 
CTTCCTACTC 
CGTGGAGACA 
AGGTTGCCCC 
GGAGCTGCTT 
CGCAGTGCCC 
CGGGAGAAGC 
ACCCCCGTCG 
GTGTACGGCT 
CTGGGGCTCC 
TCATCTCCCT 
AAGATGAGCG 
CTGTGTTCCG 
TCCTGCACTG 
TTTTATGTCA 
GAAGAGTGTC 
AGAGGGTGCA 
GAAGCCAGGC 
TGACGGGCTG 
CGTTCCGCAG 
CTGTTCAGCG 
CGCCTCTGTG 
TGCTGCGTGT 
GTGGATGTGA 
GGTCATCGCC 
ATGCCGTGGT 
AGCCACGTCT 
GGCTCACCTG 
AGAGCTCCTC 
CGCTTCATGT 



CCGGCCACCC 
CCTGCTGCGC 
GGCGCCTGGG 
GCTTTCCGCG 
ANGGCNGCCC 
ANCTGGTGGC 
GTGCTGGCCT 
CGAGGCCTTC 
ACGCACTGCG 
GACGACGTGC 
GGNTCCCAGC 
GCGCTGCNAC 
GTCTGGGATC 
CCCTGGGCTG 
GAAGTCTGCC 
GAGCGGACGC 
TGGACCGAGT 
AAGAAGCCAC 
CCATCCGTGG 
ACCACGTCCT 
CTTCCTCTAC 
AATATATCTG 
NTCTTTCTGG 
GCCTGCCCCA 
GGGAACCACG 
GCTGCGAGCT 
CCCAGGGCTC 
CCTGGTGCAG 
TCGTGCGGGC 
AGGCACAACG 
GGGGAAGCAT 
TGCGGGACTG 
GCCGCAGAGC 
GCTGATGAGT 
CGGAGACCAC 
TGGAGCAAGT 
GCTGCGGGAG 
CCGCCCTGCT 
CGGCCGATTG 
AGAT^AAGAGG 
TGCTCAACTA 
CTGGGCCTGG 
GCGGGCCCAG 
CGGGCGCGTA 
AGCATCATCA 
CCAGAAGGCC 
CTACCTTGAC 
CAGGANAACA 
CCTGAATGAG 
GCCACCACGC 



FIG. 50 




62/78 



2501 CGTGCGCATC AGGGGCAAGT CCTACGTCCA GTGCCAGGGG ATCCCGCAGG 

2551 GCTCCATCCT CTCCACGCTG CTCTGCAGCC TGTGCTACGG CGACATGGAG 

2601 AACAAGCTGT TTGCGGGGAT TCGGCGGGAC GGGCTGCTCC TGCGTTTGGT 

2651 GGATGATTTC TTGTTGGTGA CACCTCACCT CACCCACGCG AAAACCTTCC 

2701 TCAGGACCCT GGTCCGAGGT GTCCCTGAGT ATGGCTGCGT GGTGAACTTG 

2751 CGGAAGACAG TGGTGAACTT CCCTGTAGAA GACGAGGCCC TGGGTGGCAC 

2 801 GGCTTTTGTT CAGATGCCGG CCCACGGCCT ATTCCCCTGG TGCGGCCTGC 

: , 2 851 TGCTGGATAC CCGGACCCTG GAGGTGCAGA GCGACTACTC CAGCTATGCC 

1^ 2 901 CGGACCTCCA TCAGAGCCAG TCTCACCTTC AACCGCGGCT TCAAGGCTGG 

2 951 GAGGAACATG CGTCGCAAAC TCTTTGGGGT CTTGCGGCTG AAGTGTCACA 
Q 3 001 GCCTGTTTCT GGATTTGCAG GTGAACAGCC TCCAGACGGT GTGCACCAAC 
i|i 3 051 ATCTACAAGA TCCTCCTGCT GCAGGCGTAC AGGTTTCACG CATGTGTGCT 
'%^' 3101 GCAGCTCCCA TTTCATCAGC AAGTTTGGAA GAACCCCACA TTTTTCCTGC 
':f: ' 3151 GCGTCATCTC TGACACGGCC TCCCTCTGCT ACTCCATCCT GAAAGCCAAG 
ry 32 01 AACGCAGGGA TGTCGCTGGG GGCCAAGGGC GCCGCCGGCC CTCTGCCCTC 

32 51 CGAGGCCGTG CAGTGGCTGT GCCACCAAGC ATTCCTGCTC AAGCTGACTC 

in 3301 GACACCGTGT CACCTACGTG CCACTCCTGG GGTCACTCAG GACAGCCCAG 

3351 ACGCAGCTGA GTCGGAAGCT CCCGGGGACG ACGCTGACTG CCCTGGAGGC 

'1, 34 01 CGCAGCCAAC CCGGCACTGC CCTCAGACTT CAAGACCATC CTGGACTGAT 

3451 GGCCACCCGC CCACAGCCAG GCCGAGAGCA GACACCAGCA GCCCTGTCAC 

:C 3501 GCCGGGCTCT ACGTCCCAGG GAGGGAGGGG CGGCCCACAC CCAGGCCCGC 

1^ 3551 ACCGCTGGGA GTCTGAGGCC TGAGTGAGTG TTTGGCCGAG GCCTGCATGT 

3601 CCGGCTGAAG GCTGAGTGTC CGGCTGAGGC CTGAGCGAGT GTCCAGCCAA 

=2 3651 GGGCTGAGTG TCCAGCACAC CTGCCGTCTT CACTTCCCCA CAGGCTGGCG 

y 3701 CTCGGCTCCA CCCCAGGGCC AGCTTTTCCT CACCAGGAGC CCGGCTTCCA 

iU 3751 CTCCCCACAT AGGAATAGTC CATCCCCAGA TTCGCCATTG TTCACCCCTC 

3 8 01 GCCCTGCCCT CCTTTGCCTT CCACCCCCAC CATCCAGGTG GAGACCCTGA 
3851 GAAGGACCCT GGGAGCTCTG GGAATTTGGA GTGACCAAAG GTGTGCCCTG 
3901 TACACAGGCG AGGACCCTGC ACCTGGATGG GGGTCCCTGT GGGTCAAATT 
3951 GGGGGGAGGT GCTGTGGGAG TAAAATACTG AATATATGAG TTTTTCAGTT 
4001 TTGAAAAAAA AAAAAAAAAA AAAAAAAAA 
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GCAGCGCTGCGTCCTGCTGCGCACGTGGGAAGCCCTGGCCCCX5GCCACCCCCGCGATGCC ' -^^^ ' 

1 + + _,. + + + 60 

CGTCGCGACGCAGGACGACGCGTGCACCCTTCGGGACCGGGGCCGGTGGGGGCGCTACGG 

a AALRPAAHVGSPGPGHPRDA 

b QRCVLLRTWEALAPATPAMP- 

C SAASCCARGKPWPRPPPRCR- 

GCGCGCTCCCa3CTGCCX3AGCCX3TGCGCTCCCTGCTGCGCAGCCACrACCGCGAGGTGC^ 

61 + + + + + -+ 120 

CGCX3CGAGGGGCGACGGCTCGGCACGCGAGGGACGACGCGTCGGTGATGGCGCTCCACGA 

a ARSPLPSRALPAAQPLPRGA 

b RAPRCRAVRSLLRSHYREVL- 

C ALPAAEPCAPCCAATTARCC- 

GCCX3CTGGCCACGTTCGTGCGGCGCCTGGGGCCCCAGGGCTGGCGGCTGGTGCAGCGCGG 

121 + + + + + + 180 

CGGa3ACCGGTGC7^GCACGCCGCGGACCCaX3GGTCCCGACCGCa3ACCACGTCGra 

i'^" a AAGHVRAAPGAPGLAAGAAR 

i3 b PLATFVRRLGPQGWRLVQRG- 

C RWPRSCGAWGPRAGGWCSAG- 

y1 GGACCaX3CGGCTTTCCX3CX3a3NTGGTGGCCCANTGCOTGGTC 

g 181 + + + + + + 240 

i« 5 CCTGGGCCGCCGAAAGGCGCGCNACCACCXKSGTNACGmCCACACGCACGGGACC 

: y 

=S a GPGGFPR?GGP??GVRALG? 

\n b DPAAFRA?VA?C?VCVPW??- 

c TRRLSARWWP?AWCACPG??- 

Q ANGGaSTGCCCCCCGCCGCCCCCTCCTTCCGCCAGGTGTCCTGCCTGAANGANCTGGTGGC 

241 + + + + + + 300 

TNCCGNCGGGGGGCX3GCGGGGGAGG7^GGCGGTCCACAGGACGGACTTNCTNGACCACCG 

a ? AAPRRPLLPPGVLPE??GG 

;ir b ??PPAAPSFRQVSCL??LVA- 

C G?PPPPPPSARCPA*??WWP- 

ill 

CCGAGTGCTGCANANGCTGTGCGANCGCGGCGCGAANAACGTGCTGGCCTTCGGCTTCGC 

301 + + + + + + 360 

GGCTCACGACGTNTNCGACACGCTNGCGCCGCGCTTNTTGCACGACCGGAAGCCGAAGCG 

a PSAA?AVR?RRE?RAGLRLR 

b RVL??LC?RGA?NVLAFGFA- 

C ECC??CA?AAR?TCWPSASR- 

GCTGCTGGACGGGGCCCGCGGGGGCCCCCCCGAGGCCTTCACCACCAGCGTGCGCAGCTA 

361 + + + + + --+ 420 

CGACGACCTGCCCCGGGCGCCCCCGGGGGGGCTCCGGAAGTGGTGGTCGCACGCGTCGAT 

a AAGRGPRGPPRGLHHQRAQIi 

b LLDGARGGPPEAFTTSVRSY- 

C CWTGPAGAPPRPSPPACAAT- 

CCTGCCCAACACGGTGACCXSACGCACTGaSGGGGAGCGGGGCGTGGGGGCTGCTGC 

421 + + + + + + 480 

GGACGGGTTGTGCCACTGGCTGOSTGACGCCCCCTCGCCCCXSCACCCCCGACGACGACGC 

a PAQHGDRRTAGERGVGAAAA 

b LPNTVTDALRGSGAWGLLLR- 

c CPTR*PTHCGGAGRGGCCCA- 
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a PRGRRRAGSPAGTLR??CAG - 

b RVGDDVLVHLLARCA?FVLV-'w 

C AWATTCWFTCWHAAR?LCWW- 

GGOTCCCAGCTXSCGCCTACa^GTGTGCXSGGCaSCaSC^ 

541 + + + + + + 600 

CO^AGGGTCGACGCGGATGGTNCACACGCCCGGCGGCGACTITGGTCXSAGCCGCXSACGN^ 

a GSQLRLP?VRAAAVPARRC? 

b ?PSCAY?VCGPPLyQLGAAT- 

C ?PAAPT?CAGRRCTSSAL ?L- 

TCAGGCCCGGCCCCCGCCACACGCTANTGGACCCGAANGCGTCTGGGATCCT^CGGGCCT 

601 + + + + + + 660 

AGTCCGGGCCGGGGGCGGTGTGCGATNACCTGGGCTTNCGCAGACCCTAGGTTGCCCGGA 

a SGPAPATR?WTR?RLGSNGP 

b QARPPPHA?GPE?VWDPTGL- 

C RPGPRHTL?DP?ASGIQRAW- 

jassls 

;«« GGAACCATAGCGTCAGGGAGGCCGGGGTCCCCCTGGGCTGCCAGCCCCGGGTGCGAGGAG 

y 661 + + + + + + 720 

jjj CCTTGGTATCGCAGTCCCTCCGGCCCCAGGGGGACCCGACGGTCGGGGCCCACGCTCCTC 

't! a GTIASGRPGSPWAASPGCEE 

b EP*RQGGRGPPGLPAPGARR- 
ilj C NHSVREAGVPLGCQPRVRGG- 

GCGa3GGGGCAGTGCCAGCCX3AAGTCTGCCGTTGCCCAAGAGGCCCAGGCX3TGGCGCTGC 
yi 721 + + + + + + 780 

;s cgcxscccccgto^cggtcggcttcagaosgcaacgggttctccgggtccgcaccgcgara 

'•^ a argqcqpksavaqeaqawrc 

£ b rggsasrslplpkrprrgaa- 

iIl C agavpaevcrcprgpgvalp- 

ccctgagccxsgagcggacgcccgttgggcaggggtcctoggcccacccgggcaggacgc 

781 + + + + + + 840 

I?; GGGACTa3GCCTCGCCTGCGGGCAACCa3TCCCa\GGACCa3GGTGGGCCCGT^ 

a P *AGADARWAGVLGPPGQDA 

b PEPERTPVGQGSWAHPGRTP- 

c LSRSGRPLGRGPGPTRAGRL- 

TGGACCGAGTGACCGTGGTTTCTGTGTGGTGTCACCTGCCAGACCCGCCGAAGAAGCCAC 

841 + + + + + + 900 

ACCIXSGCTCACTGGCACCAAAGACACACCACAGTGGACXSGTCTGGGCGGCrrT 

a WTE*PWFLCGVTCQTRRRSH 

b GPSDRGFCVVSPARPAEEAT- 

C DRVTVVSVWCHLPDPPKKPP- 

CTCITTGGAGGGTGCGCTCrCTGGCACGCGCCACTCCCACCCy^TCCXSTC^ 

901 + + + + + + 960 

GAGAAACCTCCCACGCGAGAGACCGTGCGCGGTGAGGGTGGGTAGGCACCCGGCGGTCGT 

a LFGGCALWHAPLPPI RGPPA 

b SLEGALSGTRHSHPSVGRQH- 

C LWRVRSLARATPTHPWAAST- 

CCACGCGGGCCCCCCATCCACATCGCGGCCACCACGTCCTGGGACACGCCTTGTCCCCCG 

961 _ + + + + + + 1020 

GGTGCGCCCGGGGGGTAGGTGTAGCGCCGGTGGTGCAGGACCCTGTGCGGAACAGGGGGC 
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PRGPPIHIAATTSWDTPCPP 
HAGPPSTSRPPRPGTRLVPR- 
TRAPHPHRGHHVLGHALS PG- 

GTGTACXSCCXSAGACCAAGCACTTCCrCTACTCCrCAGGCGAC^ 

1021 + + + + + + 1080 

CACATGCGGCTCTGGTTraTGAAGGAGATGAGGAGTCCGCTGTTCa^ 

VYAETKHFLYSSGDK?TA?L 
CTPRPSTSSTPQAT?TLRPS- 
VRRDQALPLLLRRQ?HC? PP- 

CTTCCTACTCAATATATCTGAGGCCCAGCCTGACTGGCGTTCGGGAGGTTCGTGGAGACA 

1081 + + + + + + 1140 

GAAGGATGAGTTATATAGACTCCGGGTCGGACTGACCGCT^GCCCTCCAAGCACCTCTGT 

LPTQYI*GPA*LAFGRFVET 
FLLNISEAQPDWRSGGSWR? - 
SYS lYLRPSLTGVREVRGD?- 

NTCTTTCTGGTTCCAGGCCTTGGATGCCAGGATTCCCCGCAGGTTGCCCCGCCTGCCCCA 

1141 + + + + + + 1200 

NAGAAAGACCAAGGTCCGGAACCTACGGTCCTAAGGGGCGTCCAACGGGGCGGACGGGGT 

7FLVPGLGCQDSPQVAPPAP 
SFWFQALDARIPRRLPRLPQ- 
LSGSRPWMPGFPAGCPACPS- 

GaOTACTGGCAAATGCGGCCCCTGTTTCTGGAGCTGCTTGGGAACCACGCGCAGT^ 

1201 + + + + + + 1260 

CX3CNATGACa3TTTACGCCGGGGACAAAGACCTCX3ACGAACX:CT^ 

A?LANAAPVSGAAWEPRAVP 
RYWQMR PLFLELLGNHAQ CP - 
?TGKCGPCFWSCLGTTRSAP- 

CTACXX3GGTGTTCCTCAAGACGCACK3CCCGCTGa3AGCTGC^ 

1261 + + + + + + 1320 

GATGCCCCACT^GGAGTTCrcCXSTGACXKXSaSACGCTC^ 

LRGVPQDALPAASCGHPSSR 
YGVFLKTHCPLRAAVTPAAG- 
TGCSSRRTARCELRSPQQPV' 

TGTCTGTGCCCXX3GAGAAGCCCCAGGGCTCTGTGGCGGCCCCa3AGGAGGAGGAAC7^m 

1321 + + - + + + + 1380 

ACAGACACGGGCCCTCITCGGGGTCCCGAGACACCX3CCGGGGGCTCCTCCTCCT 

CLCPGEAPGLCGGPRGGGTQ 
VCAREKPQGSVAAPEEEEHR- 
SVPGRSPRALWRPPRRRNTD- 

ACCCCCGTCXSCCTGGTGCAGCTGCTCCGCCAGCACAGCAGCCCCTG^ 

1381 + + + + + + 1440 

TGGGGGCAGOSGACCACGTCGACXSAGGCGGTCXSTGTCXSTCGGGGACCXS 

TPVAWCSCSASTAAPGRCTA 
PPSPGAAAPPAQQPLAGVRL- 
PRRLVQLLRQHSS PWQVYGF- 

TCGTGCGGGCCTGCCTGCGCCGGCTGGTGCCCCCAGGCCTCTGGGGCTCCAGGCACAACG 

1441 + + + + + + 1500 

AGCACGCCCXSGACGGACGCGGCCGACCACGGGGGTCCGGAGACCCCGAGGTCCGTGTTGC 
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SCGPACAGWCPQASGAPGTT 
RAGLPAPAGAPRPLGLQAQR- 
VRACLRRLVPPGLWGSRHNE- 

AACXKTCGCTTCCTCAGGAACACCy^GAAGTTCATCTCCCrGGGG^ 

1501 + + + + + + 1560 

TTGCGGCXSAAGGAGTCCITGTGGTTCTTCAAGTAGAGGGACCCC^ 

NAASSGTPRSSSPWGSMPSS 
TPLPQEHQEVHLPGEACQAL - 
RRFLRNTKKFISLGKHAKLS- 

CGCTGCAGGAGCTGACGTGGAAGATGAGCGTGCGGGACTGCGCTTGGCTGCGCAGGAGCC 

1561 + + + + + + 1620 

GCGACGTCCTCGACTGCACCTTCTACTCGCACGCCCTGACGCGAACCGACGCGTCCTCGG 

RCRS*RGR*ACGTALGCAGA 
AAGADVEDERAGLRLAAQEP - 
LQELTWKMSVRDCAWLRRS P- 

CAGGGGTTGGCTGTGTTCCGGCCGCAGAGCACCGTCTGCGTGAGGAGATCCTGGCCAAGT 

1621 + + + + + + 1680 

GTCCCCAACCGACACAAGGCCGGCGTCTCXSTGGCAGACGCACTCCTCTAGGACCGGTTC^ 

QGLAVFRPQSTVCVRRSWPS 
RGWLCSGRRAPSA*GDPGQV- 
GVGCVPAAEHRLREEILAKF- 

TCCraCACTGGCTGATGAGTGTGTACGTCXSTaSAGCIGCT 

1681 + + + + + + 1740 

AGGACXSTGACCXSACTACTOVCACATGCAGCAGCTaaACGAGTCCAG?^ 

SCTG**VCTSSSCSGLSFMS 
PALADECVRRRAAQVFLLCH - 
LHWLMSVYVVELLRSFFYVT- 

CGGAGACCACGTTTO^AAAGAACAGGCTCTTTTTCTACCGGAAGAGTGTCT 

1741 + + + + + + 1800 

GCCTCTGGTGCAAAGTTTTCITGTCCGAGAAAAAGAT^ 

RRPRFKRTGSFSTGRVSGAS 
GDHVSKEQALFLPEECLEQV- 
ETTFQKNRLF.FYRKSVWSKL- 

TGCAAAGCATTGGAATGAGAOVGCACITGAAGAGGGTGC^ 

1801 + + + + + + I860 

ACXSTTTCGTAACCTTAGTCTGTOSTGAACITCTCCCACGTCXSAra 

CKALESDST**RGCSCGSCRK 
AKHWNQTALEEGAAAGAVGS - 
QSIGIRQHLKRVQLRELSEA- 

CAGAGGTCAGGCAGCATCGGGAAGCCAGGCCCGCCCTGCTGACGTCCAGACTCCGCTTCA 

1861 + + + + + -f 1920 

GTCTCCAGTCCGTCGTAGCCCTTCGGTCCGGGCGGGACGACTGCAGGTCTGAGGCGAAGT 

QRSGSIGKPGPPC*RPDSAS 
RGQAASGSQARPADVQTPLH- 
EVRQHREARPALLTSRLRFI- 

TCCCCAAGCCTGACGGGCTGCGGCCGATTGTGAACATGGACTACGTCGTGGGAGCCAGAA 

1921 + + + + + + 1980 

AGGGGTTCGGACTGCCCGACGCCGGCTAACACTTGTACCTGATGCAGCACCCTCGGTC 
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SPSLTGCGRL*TWTTSWEPE 
PQA*RAAADCEHGLRRGSQN- 
PKPDGLRPIVNMDYVVGART- 

CGTTCaSCAGAGAAAAGAGGGCCGAGOSTCTCACCTCGAGGGTGAAGGCACTO 

1981 + + + + + + 2040 

GO^GGCGTCTCTTTTCrrCCCGGCTCGCAGAGTGGAGCTCCCACT^ 

RSAEKRGPSVSPRG*RHCSA 
VPQRKEGRASHLEGEGTVQR- 
FRREKRAERLTSRVKALFSV- 

TGCrCAACTACXSAGCGGGCGCGGOSCCCCGGCCTCCTGGGOT 

2041 + + + + + + 2100 

ACGAGTTGATGCTCGCCCGCGCCX3CGGGGCCGGAGGACCCGCGGAGACACX3ACCCGGACC 

CSTTSGRGAPASWAPLCWAW 
AQLRAGAAPRPPGRLCAGPG- 
LNYERARRPGLLGASVLGLD- 

ACGATATCCACAGGGCCTGGCGCACCTTaBTGCTGCGTGTGCGGGCCCAGGACCCGCCGC 

2101 + + + + + + 2160 

TGCTATAGGTGTCCCGGACCGCGTGGAAGCACGACGCACACGCCCGGGTCCTGGGCGGCG 

TISTGPGAPSCCVCGPRT. RR 
RYPQGLAHLRAACAGPGPAA- 
DIHRAWRTFVLRVRAQDPPP- 

CTGAGCTGTACITTGTCAAGGTGGATGTGACGGGCXSCGTAC^ 

2161 + + + + + + 2220 

GACTCGACATGAAACAGTTCCACOTACACn^CCCXSOGCATGCTGT^ 

LSCTLSRWM*RARTTPSPRT 
*AVLCQGGCDGRVRHHPPGQ- 
ELYFVKVDVTGAYDT I PQDR- 

GGCTCACGGAGGTCATCXSCCAGCATCATCAAACCCCAGAACACGTACTGCGTGra 

2221 + -+ + + + + 2280 

CCGAGTGCCTCCAGTAGCXXSTCGTAGTAGTTTGGGGTCTTGTGCATGACGC^^ 

GSRRSSPASSNPRTRTACVG 
AHGGHRQHHQTPEHVLRASV- 
LTEVIAS I IKPQNTYCVRRY- 

ATGCCX3TGGTCCAGAAGGCCGCCCATGGGCACX3TCCX3CAAGGCCTTCAA 

2281 + + + + + + 2340 

TACXSGCACCAGGTCITCCGGCGGGTACCaSTGCAGGCXSTTCC^ 

MPWSRRPPMGTSARPSRATS 
CRGPEGRPWARPQGLQEPRL- 
AVVQKAAHGHVRKAF KSHVS- 

CTACCTTGACAGACCTCCAGCCGTAGATGCGACAGTTCGTGGCrCACCTGCAGG;^ 

2341 + + + + + + 2400 

GATGGAACTGTCTCGAGGTCXKSCATGTACXSCTGTCAAGm 

LP*QTSSRTCDSSWLTCR-?T 
YLDRPPAVHATVRGSPAG7Q- 
TLTDLQPYMRQFVAHLQ7NS- 

GCCCGCTGAGGGATGCCGTCGTCATCGAGCAGAGCTCCTCCCTGAATGAGGCCAGCAGTG 

2401 + + + + + + 2460 

CGGGCGACTCCCTACGGCAGCAGTAGCTCGTCTCX3AGGAGGGACTTACTCCGGTCGTCAC 
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a AR*GMPSSSSRAPP*MRPAV 

b PAEGCRRHRAELLPE*GQQW- 

C PLRDAVVIEQSSSLNEASSG- 

GCCTCTTCXSACXSTCnTCCTACXSCnTCATGTGCCACCAC^ 

2461 + + + + + + 2520 

CGGAGAAGCnXSCAGAAGGATGCGAAGTACACGGTGGTGaSGCAra 

a ASSTSSYASCATTPCASGAS 

b PLRRLPTLHVPPRRAHQGQV- 

C LFDVFLRFMCHHAVRIRGKS- 

CCTACGTCCAGTGCCAGGGGATCCCGCAGGGCTCCTVTCCTCrCC^ 

2521 + + + + + + 2580 

GGATGCAGGTCACXSGTCCCCTAGGGCGTCCCGAGGTAGGAGAGGTGCGACGAGACXSTCGG 

a PTSSARGSRRAPSSPRCSAA 

b LRPVPGDPAGLHPLHAALQP- 

C YVQCQGIPQGSILSTLLCSL- 

TGTGCTACGGCGACATGGAGAACAAGCTGTTTGa3GGGATTCGGCGG(^ 

2581 + + + + + + 2640 

I*---; ACACGATGCCGCTGTACCTCTTGTTCGACAT^CGCCCCTAAGCCGCCCTGCCCGACGAGG 

'3 a CATATWRTSCLRGFGGTGCS 

ijl b VL RRHGEQAVCGDSAGRAAP- 

C CYGDMENKLFAGIRRDGLLL- 

flf TGCGTTTGGTGGATGATTTCTTGTTGGTGACACCTCACCrCACCC^ 

r- 2641 + + - + + + + 2700 

AOSCAAACCACXrrACTAAAGAACAACCACrGTGGAGTGGAGTGGG 

y I 

is a CVWWMISCW*HLTSPTRKPS- 

b AFGG*FLVGDTSPHPRENLP' 

^ C RLVDDFLLVTPHLTHAKTFL- 

-3" 

TCAGGACCCI^TCCX3AGGTGTCCCTGAGTATGGCTGa3TGGTGAACTTC 

i . 2701 + + + + + + 2760 

^ AGTCCTGGGACCAGGCTCCACAGGGACTCATACa3ACX3CACCACTTGAAOT 

jfi a SGPWSEVSLSMAAW*TCGRQ- 

b QDPGPRCP*VWLRGELAEDS- 
c RTLVRGVPEYGCVVNLRKTV- 

TGGTGAACTTCCCTGTAGAAGACX3AGGCCCTGGGTGGCACGGCTTTTGTTCAGATGCCGG 
2761 + + + + + + 2820 

accacttgaagggacatcitctgctccxsggacccaccgtgccgaa;^ 

a W*TSL*KTRPWVARLLFRCR 

b gelpcrrrgpgwhgfcsdag- 
C vnfpvedealggtafvqmpa- 

CCCACGGCCTATTCCCCnXSGTGaSGCCTGCnXSCro 

2821 + + + + + + 2880 

GGGTGCCGGATAAGGGGACCACGCCXSGAaSAraACCrATGGGCCTGGGAC^ 

a PTAYSPGAACCWIPGPWRCR 

b PRPIPLVRPAAGYPDPGGAE- 

C HGLFPWCGLLLDTRTLEVQS- 

GCXSACTACTCCAGCTATGCCCGGACCTCCATCAGAGCCAGTCTCACCTTCAACCGCGGCT 

2881 + + + + + + 2940 

CGCTGATGAGGTCXSATAOSGGCCrGGAGGTAGTCTCGGTCAGAGTGGAAGTTGGCXSCC 
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ATTPAMPGPPSEPVSPSTAA 
RLLQLCPDLHQSQSHLQPRL- 
DYSSYARTSIRASLTFNRGF- 

TCAAGGCTGGGAGGAACATGCGTCGCAAACTCTTTGGGGTCTTGCGGCrrG;^ 

2941 + + + + + + 3000 

AGTTCOSACCCTCCTTGTACXSCAGCGTTTGAGAAACCCCAGAACGCCGACTTm 

SRLGGTCVANSLGSCG*SVT 
QGWEEHASQTLWGLAAEVSQ- 
KAGRNMRRKLFGVLRLKCHS- 

GCCTGTTTCTGGATTTGCAGGTGAACAGCCTCCAGACGGTGTGCACCT^CATCTAC^ 

3001 + + + + + + 3060 

CGGACAAAGACCTAAACGTCCACTTGTCGGAGGTCTGCCACACGTGGTTGTAGATGTTCT 

ACFWICR*TASRRCAPTSTR 
PVSGFAGEQPPDGVHQHLQD - 
LFLDLQVNSLQTVCTNI YKI- 

TCCTCCTOCTGCAGGCGTACAGGTTTCAOGCATGTGTGCTGC^ 

3061 + + + + + + 3120 

AGGAGGACGACXSTCCGCATGTCCAAAGTGCGTACACACGACXSTCGAGGGTAAAGTAGTOT 

SSCCRRTGFTHVCCSSHFIS 
PPAAGVQVSRMCAAAPISSA- 
LLLQAYRFHACVLQLPFHQQ- 

AAGTTTGGAAGAACCCCT^CATTTTTCCTOCGCXSTCATCTCTGA^ 

3121 + + + + + + 3180 

TTCAAACCTTCTTGGGGTGTAAAAAGGACGOSCAGTAGAGACTGTGCCGGAG^ 

KFGRTPHFSCASSLTRPPSA 
SLEEPHIFPARHL*HGLPLL- 
VWKNPTFFLRVISDTASLCY- 

ACTCCATCCTGAAAGCCT^GAAOSCAGGGATGTaSCTGGGGGCa^G^ 

3181 + + + + + + 3240 

TGAGGTAGGACTTTCGGTTCITGCGTCCCTACAGCGACCCCCXSGTTC 

TPS*KPRTQGCRWGPRAPPA 
LHPESQERRDVAGGQGRRRP- 
S I LKAKNAGMS LGAKGAAGP- 

CTCTGCCCTCCXSAGGCCGTGCAGTGGCTGTGCCACCT^GCATTCCT^^ 

3241 + + + + + + 3300 

GAGACGGGAGGCTCCGGCACX3TCACCGACACGGTGGTTCX3TAAGGACGAGTTOT 

LCPPRPCSGCATKHSCSS*L 
SALRGRAVAVPPSIPAQADS- 
LPSEAVQWLCHQAFLLKLTR- 

GACT^CCXSTGTCACCTACGTGCCACTCCTGGGGTCACTCAGGACAGCCCAGACGCAGCT 

3301 + + + + + + 3360 

CTGTGGCACAGTGGATGCACX3GTGAGGACCCCAGTGAGTCCTGTCGGGTCTGCGTCGACT 

DTVSPTCHSWGHSGQPRRS* 
TPCHLRATPGVTQDSPDAAE- 
HRVTYVPLLGSLRTAQTQLS- 

GTCGGAAGCTCCCGGGGACGACGCTGACTGCCCTGGAGGCCGCAGCCAACCCGGCACTGC 

3361 + + + + + + 3420 

CAGCCTTCGAGGGCCCCTGCTGCGACTGACGGGACCTCCX3GCGTCGGTTGGGCCGTGACG 
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a VGSSRGRR*LPWRPQPTRHC 

b SEAPGDDADCPGGRSQPGTA- 

c RKLPGTTLTALEAAANPALP- 

CCrCAGACTTCAAGACCATCCTGGACTGATGGCCACCCGCCCACA 

3421 + + + + + + 3480 

GGAGTCTGAAGTTCraGTAGGACCTGACTACCGGTGGGCGGGTGTCGGTCCGGCTCTCGT 

a PQTSRPSWTDGHPPTARPRA 

b LRLQDHPGLMATRPQPGREQ- 

C SDFKTILD*WPPAHSQAESR- 

GACACCAGCAGCCCTGTCACGCaSGGCTCrACGTCCCAC^ 

3481 + + + + + + 3540 

CTGTGGTCGTCGGGACAGTGCGGCCCGAGATGCAGGGTCCCTCCCTCCCCGCCGGGTGTG 

a DTSSPVTPGSTSQGGRGGPH 

b TPAALSRRALRPREGGAAHT- 

C HQQPCHAGLYVPGREGRPTP- 

J . CCAGGCCCGCACCGCTGGGAGTCTGAGGCCTGAGTGAGTGTTTGGCCGAGGCCTGCATGT 

3541 + + + + + + 3600 

ij= GGTCCGGGOSTGGCGACCCTCAGACTCCGGACTCACTCACAAACCGGCTCCGGACGTAC^ 

i'^! a PGPHRWESEA*VSVWPRPAC- 

lli b QARTAGSLRPE*VFGRGLHV- 

C RPAPLGV*GLSECLAEACMS- 

' ^ CCGGCTGAAGGCTGAGTGTCCGGCTGAGGCCTGAGCGAGTGTCCAGCCAAGGGCTGAGTG 

CI 3601 + -- - + + + + + 3660 

i p GGCCGACTTCCGACTCACAGGCCGACrCCXSGACTCGCTCACAGGTaS^ 

a PAEG*VSG*GLSECPAKG*V- 
rn b RLKAECPAEA*ASVQPRAEC- 

c G*RLSVRLRPERVSSQGLSV- 

TCCAGCACACCTGCOSTCTTCACrTCCCCACAGGCTGGaSCT 

3661 + + + + + + 3720 

AGGTCGTGTGGACGGCAGAAGTGAAGGGGTGTCCGACCGCGAGCCGAGGTGGGGTCCCGG 

11] a SSTPAVFTSPQAGARLHPRA 

b PAHLPSSLPHRLALGSTPGP- 
c QHTCRLHFPTGWRSAPPQGQ- 

AGCTTTTCCTCACCAGGAGCCCGGCTTCCACTCCCCACATAGGAATAGTCCATCCCCAGA 

3721 + + + + + + 3780 

TCGAAAAGGAGTGGTCCTCGGGCCX3AAGGTGAGGGGTGTATCCTTATCAGGTAGGGGTCT 

a SFSSPGARLPLPT*E*SIPR 

b AFPHQEPGFHSPHRNSPSPD- 

C LFLTRSPASTPHIGIVHPQI- 

TTCGCOVTTGTTCACCCCrCGCCCrroCCCTCCTTTGCCOT 

3781 + + + + + + 3840 

AAGCXSGTAACAAGTGGGGAGCGGGACXXXSAGGAAAC^^ 

a FAIVHPSPCPPLPSTPTIQV 

b SPLFTPRPALLCLPPPPSRW- 

C RHCSPLALPSFAFHPHHPGG- 

GAGACCCTGAGAAGGACCCTGGGAGCTCTGGGAATTTGGAGTGACCAAAGGTGTGCCCTG 

3841 + + + + + + 3900 

CTCTGGGACTCTTCCTGGGACCCTCGAGACCCTTAAACCTCACTGGTTTCCACACGGGAC 



FIG. 51 

(CONTINUED) 



+ 



+ 



71/78 



a ETLRRTLGALGIWSDQRCAL 

b RP*EGPWELWEFGVTKGVPC- 

c DPEKDPGSSGNLE*PKVCPV- 

TACACAGGCXSAGGACCCTGCACCTGGATGGGGGTCCCTGTGGGTCT^T^^ 

3901 + + + + + + 3960 

ATGTGTCCGCTCCTGGGACGTGGACCTACCCCCAGGGACACCCAGTTTAACCCCCCTCCA 

Q a YTGEDPAPGWGSLWVKLGGG 

b TQARTLHLDGGPCGSNWGEV- 
"v: C HRRGPCTWMGVPVGQIGGRC- 

ilj GCTGTGGGAGTAAAATACTGAATATATGAGTTTTTCAGTTTTGAAAAAAAAAAAAAAAAA 

3961 + + + + + + 4020 

CGACACCCTCATTTTATGACTTATATACTCAAAAAGTCAAAACTTTTTTTTTTTTTTTTT 

a AVGVKY*IYEFFSFEKKKKK 
b LWE*NTEYMSFSVLKKKKKK- 
C CGSKILNI*VFQF*KKKKKK- 

AAAAAAAAA 

4021 4029 

TTTTTTTTT 



a K K K 

b K K 

c K K 
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1 

met 

GCAGCGCTGCGTCCTGCTGCGCACGTGGGAAGCCCTGGCCCCGGCCACCCCCGCG ATG 

10 

pro arg ala pro arg cys arg ala val arg ser leu leu arg ser 
CCG CGC GCT CCC CGC TGC CGA GCC GTG CGC TCC CTG CTG CGC AGC 

20 30 
his tyr arg glu val leu pro leu ala thr phe val arg arg leu 
CAC TAG CGC GAG GTG CTG CCG CTG GCC ACG TTC GTG CGG CGC CTG 

40 

gly pro gin gly trp arg leu val gin arg gly asp pro ala ala 
GGG CCC CAG GGC TGG CGG CTG GTG CAG CGC GGG GAC CCG GCG GCT 

50 60 
phe arg ala leu val ala gin cys leu val cys val pro trp asp 
TTC CGC GCG CTG GTG GCC CAG TGC CTG GTG TGC GTG CCC TGG GAC 

70 

ala arg pro pro pro ala ala pro ser phe arg gin val ser cys 
GCA CGG CCG CCC CCC GCC GCC CCC TCC TTC CGC CAG GTG TCC TGC 

80 90 
leu lys glu leu val ala arg val leu gin arg leu cys glu arg 
CTG AAG GAG CTG GTG GCC CGA GTG CTG CAG AGG CTG TGC GAG CGC 

100 

gly ala lys asn val leu ala phe gly phe ala leu leu asp gly 
GGC GCG AAG AAC GTG CTG GCC TTC GGC TTC GCG CTG CTG GAC GGG 

110 120 
ala arg gly gly pro pro glu ala phe thr thr ser val arg ser 
GCC CGC GGG GGC CCC CCC GAG GCC TTC ACC ACC AGC GTG CGC AGC 

130 

tyr leu pro asn thr val thr asp ala leu arg gly ser gly ala 
TAC CTG CCC AAC ACG GTG ACC GAC GCA CTG CGG GGG AGC GGG GCG 

140 150 
trp gly leu leu leu arg arg val gly asp asp val leu val his 
TGG GGG CTG CTG CTG CGC CGC GTG GGC GAC GAC GTG CTG GTT CAC 

160 

leu leu ala arg cys ala leu phe val leu val ala pro ser cys 
CTG CTG GCA CGC TGC GCG CTC TTT GTG CTG GTG GCT CCC AGC TGC 

170 180 
ala tyr gin val cys gly pro pro leu tyr gin leu gly ala ala 
GCC TAC CAG GTG TGC GGG CCG CCG CTG TAC CAG CTC GGC GCT GCC 

190 

thr gin ala arg pro pro pro his ala ser gly pro arg arg arg 
ACT CAG GCC CGG CCC CCG CCA CAC GCT AGT GGA CCC CGA AGG CGT 
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200 210 
leu gly cys glu arg ala trp asn his ser val arg glu ala gly 
CTG GGA TGC GAA CGG GCC TGG AAC CAT AGC GTC AGG GAG GCC GGG 

220 

val pro leu gly leu pro ala pro gly ala arg arg arg gly gly 
GTC CCC CTG GGC CTG CCA GCC CCG GGT GCG AGG AGG CGC GGG GGC 

230 240 
ser ala ser arg ser leu pro leu pro lys arg pro arg arg gly 
AGT GCC AGC CGA AGT CTG CCG TTG CCC AAG AGG CCC AGG CGT GGC 

250 

ala ala pro glu pro glu arg thr pro val gly gin gly ser trp 
GCT GCC CCT GAG CCG GAG CGG ACG CCC GTT GGG CAG GGG TCC TGG 

260 270 
ala his pro gly arg thr arg gly pro ser asp arg gly phe cys 
GCC CAC CCG GGC AGG ACG CGT GGA CCG AGT GAC CGT GGT TTC TGT 

280 

val val ser pro ala arg pro ala glu glu ala thr ser leu glu 
GTG GTG TCA CCT GCC AGA CCC GCC GAA GAA GCC ACC TCT TTG GAG 

290 300 
gly ala leu ser gly thr arg his ser his pro ser val gly arg 
GGT GCG CTC TCT GGC ACG CGC CAC TCC CAC CCA TCC GTG GGC CGC 

310 

gin his his ala gly pro pro ser thr ser arg pro pro arg pro 
CAG CAC CAC GCG GGC CCC CCA TCC ACA TCG CGG CCA CCA CGT CCC 

320 330 
trp asp thr pro cys pro pro val tyr ala glu thr lys his phe 
TGG GAC ACG CCT TGT CCC CCG GTG TAC GCC GAG ACC AAG CAC TTC 

340 

leu tyr ser ser gly asp lys glu gin leu arg pro ser phe leu 
CTC TAC TCC TCA GGC GAC AAG GAG CAG CTG CGG CCC TCC TTC CTA 

350 360 
leu ser ser leu arg pro ser leu thr gly ala arg arg leu val 
CTC AGC TCT CTG AGG CCC AGC CTG ACT GGC GCT CGG AGG CTC GTG 

370 

glu thr ile phe leu gly ser arg pro trp met pro gly thr pro 
GAG ACC ATC TTT CTG GGT TCC AGG CCC TGG ATG CCA GGG ACT CCC 

380 390 
arg arg leu pro arg leu pro gin arg tyr trp gin met arg pro 
CGC AGG TTG CCC CGC CTG CCC CAG CGC TAC TGG CAA ATG CGG CCC 

400 

leu phe leu glu leu leu gly asn his ala gin cys pro tyr gly 
CTG TTT CTG GAG CTG CTT GGG AAC CAC GCG CAG TGC CCC TAC GGG 

410 420 
val leu leu lys thr his cys pro leu arg ala ala val thr pro 
GTG CTC CTC AAG ACG CAC TGC CCG CTG CGA GCT GCG GTC ACC CCA 
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ala 
GCA 


ala 
GCC 


gly 
GGT 


val 
GTC 


cys 
TGT 


ala 
GCC 


pro 
CCC 


glu 

GAG 


440 
glu 
GAG 


glu 
GAG 


leu 
CTC 


arg 
CGC 


gin 
CAG 


his 
CAC 


ser 
AGC 


ala 
GCC 


cys 
TGC 


leu 
CTG 


470 
arg 
CGC 


arg 
CGG 


his 
CAC 


asn 
AAC 


glu 
GAA 


arg 
CGC 


arg 
CGC 


leu 
CTG 


gly 
GGG 


lys 
AAG 


500 
his 
CAT 


ala 
GCC 


met 
ATG 


ser 
AGC 


val 
GTG 


arg 
CGG 


asp 
GAC 


gly 
GGC 


cys 
TGT 


val 
GTT 


530 
pro 
CCG 


ala 
GCC 


ala 
GCC 


lys 
AAG 


phe 
TTC 


leu 
CTG 


his 
CAC 


leu 
CTC 


arg 
AGG 


ser 
tCT 


560 
phe 
TTC 


phe 
TTT 


arg 
AGG 


leu 
CTC 


phe 
TTT 


phe 
TTC 


tyr 
TAC 


ile 
ATT 


gly 
GGA 


ile 
ATC 


590 
arg 
AGA 


gin 
CAG 


ser 
TCG 


glu 
GAA 


ala 
GCA 


glu 
GAG 


val 
GTC 


leu 
CTG 


thr 
ACG 


ser 
TCC 


620 
arg 
AGA 


leu 
CTC 


pro 
CCG 


ile 
ATT 


val 
GTG 


asn 
AAC 


met 
ATG 
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430 



ala 
GCC 


arg 
CGG 


glu 
GAG 


lys 
AAG 


pro 
CCC 


asp 
GAC 


thr 
ACA 


asp 
GAC 


pro 
CCC 


arg 
CGT 


ser 
AGC 


pro 
CCC 


trp 
TGG 


460 
gin 
CAG 


val 
GTG 


leu 
CTG 


val 
GTG 


pro 
CCC 


pro 
CCA 


gly 
GGC 


phe 
TTC 


leu 
CTC 


arg 
AGG 


490 
asn 
AAC 


thr 
ACC 


lys 
AAG 


leu 

etc 


ser 
TCG 


leu 
CTG 


gin 
CAG 


cys 
TGC 


ala 
GCT 


trp 
TGG 


520 
leu 
CTG 


arg 
CGC 


ala 
GCA 


glu 

GAG 


his 
CAC 


arg 
CGT 


leu 
CTG 


trp 
TGG 


leu 
CTG 


met 
ATG 


550 
ser 
AGT 


val 
GTG 


tyr 
TAT 


val 
GTC 


thr 
ACG 


glu 

GAG 


thr 
ACC 


arg 
CGG 


lys 
AAG 


ser 
AGT 


580 
val 
GTC 


trp 
TGG 


his 
CAC 

arg 
AGG 


leu 
TTG 

gin 
CAG 


lys 
AAG 

his 
CAT 


arg 
AGG 
610 
arg 
CGG 


val 
GTG 

glu 
GAA 


arg 
CGC 


phe 
TTC 


ile 
ATC 


pro 
CCC 


lys 
AAG 


asp 
GAC 


tyr 
TAC 


val 
GTC 


640 
val 
GTG 


gly 
GGA 
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gin gly ser val ala 
CAG GGC TCT GTG GCG 

450 

arg leu val gin leu 
CGC CTG GTG CAG CTG 



tyr gly phe val arg 
TAC GGC TTC GTG CGG 

480 

leu trp gly ser arg 
CTC TGG GGC TCC AGG 



lys lys phe ile ser 
AAG AAG TTC ATC TCC 

510 

glu leu thr trp lys 
GAG CTG ACG TGG AAG 



arg ser pro gly val 
AGG AGC CCA GGG GTT 

540 

arg glu glu ile leu 
CGT GAG GAG ATC CTG 



tyr val val glu leu 
TAC GTC GTC GAG CTG 

570 

thr phe gin lys asn 
ACG TTT CAA AAG AAC 



ser lys leu gin ser 
AGC AAG TTG CAA AGC 

600 

gin leu arg glu leu 
CAG CTG CGG GAG CTG 

ala arg pro ala leu 
GCC AGG CCC GCC CTG 

630 

pro asp gly leu arg 
CCT GAC GGG CTG CGG 



ala arg thr phe arg 
GCC AGA ACG TTC CGC 



arg glu lys 
AGA GAA AAG 



phe ser val 
TTC AGC GTG 



gly ala ser 
GGC GCC TCT 



thr phe val 
ACC TTC GTG 



tyr phe val 
TAG TTT GTC 



gin asp arg 
GAG GAG AGG 



asn thr tyr 
AAC ACG TAG 



his gly his 
CAT GGG CAC 



thr asp leu 
ACA GAC GTC 



glu thr ser 
GAG ACC AGC 



ser leu asn 
TCC CTG AAT 



phe met cys 
TTC ATG TGC 



gin cys gin 
CAG TGC GAG 



cys ser leu 
TGC AGC CTG 



ile arg arg 
ATT GGG CGG 



150 

arg ala glu 
AGG GCC GAG 



leu asn tyr 
GTC AAC TAC 

680 

val leu gly 
GTG CTG GGC 



leu arg val 
CTG CGT GTG 

710 

lys val asp 
AAG GTG GAT 



leu thr glu 
CTG ACG GAG 

740 

cys val arg 
TGC GTG CGT 



val arg lys 
GTC GGC AAG 

770 

gin pro tyr 
CAG CGG TAC 



pro leu arg 
CGG CTG AGG 

800 

glu ala ser 
GAG GCC AGC 



his his ala 
CAC CAC GCC 

830 

gly ile pro 
GGG ATG CGG 



cys tyr gly 
TGC TAC GGC 

860 

asp gly leu 
GAC GGG CTG 
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arg leu thr 
CGT GTC ACC 

670 

glu arg ala 
GAG CGG GCG 



leu asp asp 
CTG GAC GAT 

700 

arg ala gin 
CGG GCC CAG 



val thr gly 
GTG ACG GGC 

730 

val ile ala 
GTC ATC GCC 



arg tyr ala 
CGG TAT GCC 

760 

ala phe lys 
GCC TTC AAG 



met arg gin 
ATG CGA CAG 

790 

asp ala val 
GAT GCC GTC 



ser gly leu 
AGT GGC CTC 

820 

val arg ile 
GTG CGC ATC 



gin gly ser 
CAG GGC TCC 

850 

asp met glu 
GAC ATG GAG 



leu leu arg 
CTC CTG CGT 



ser arg val 
TCG AGG GTG 



arg arg pro 
CGG CGC CCC 



ile his arg 
ATC CAC AGG 



asp pro pro 
GAC CGG CGG 



ala tyr asp 
GCG TAC GAC 



ser ile ile 
AGC ATC ATC 



val val gin 
GTG GTC CAG 



ser his val 
AGC CAC GTC 



phe val ala 
TTC GTG GCT 



val ile glu 
GTC ATC GAG 



phe asp val 
TTC GAC GTC 



arg gly lys 
AGG GGC AAG 



ile leu ser 
ATC CTC TCC 



asn lys leu 
AAC AAG CTG 



leu val asp 
TTG GTG GAT 



660 
lys ala leu 
AAG GCA CTG 



gly leu leu 
GGC CTC CTG 

690 
ala trp arg 
GCC TGG CGC 



pro glu leu 
CCT GAG CTG 

720 
thr ile pro 
ACC ATC CCC 



lys pro gin 
AAA CCC CAG 

750 
lys ala ala 
AAG GCC GCC 



ser thr leu 
TCT ACC TTG 

780 

his leu gin 
CAC CTG CAG 



gin ser ser 
CAG AGC TCC 

810 

phe leu arg 
TTC CTA CGC 



ser tyr val 
TCC TAC GTC 

840 
thr leu leu 
ACG CTG CTC 



phe ala gly 
TTT GCG GGG 

870 
asp phe leu 
GAT TTC TTG 
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880 



leu 
TTG 


val 
GTG 


thr 
ACA 


pro 
CCT 


his 
CAC 


leu 
CTC 


thr 
ACC 


his 
CAC 


ala 
GCG 


lys 
AAA 


thr 
ACC 


phe 
TTC 


leu 
CTC 


arg 
AGG 


thr 
ACC 


leu 
CTG 


val 
GTC 


arg 
CGA 


890 
gly 
GGT 


val 
GTC 


pro 
CCT 


glu 
GAG 


tyr 
TAT 


gly 
GGC 


cys 
TGC 


val 
GTG 


val 
GTG 


asn 
AAC 


900 
leu 
TTG 


arg 
CGG 


lys 
AAG 


thr 
ACA 


val 
GTG 


val 
GTG 


asn 
AAC 


phe 
TTC 


pro 
CCT 


val 
GTA 


910 

glu 

GAA 


asp 
GAC 


glu 

GAG 


ala 
GCC 


leu 
CTG 


gly gly 
GGT GGC 


thr 
ACG 

gly 
GGC 


ala 
GCT 

leu 
CTG 


phe 
TTT 

leu 
CTG 


Q O n 

val 
GTT 

leu 
CTG 


gin 
CAG 

asp 
GAT 


met 
ATG 

thr 
ACC 


pro 
CCG 

arg 
CGG 


ala 
GCC 

thr 
ACC 


his 
CAC 
940 
leu 
CTG 


gly 
GGC 

glu 
GAG 


leu 
CTA 

val 
GTG 


phe 
TTC 

gin 
CAG 


pro 
CCC 

ser 
AGC 


930 
trp 
TGG 

asp 
GAC 


cys 
TGC 

tyr 
TAC 


ser 
TCC 


ser 
AGC 


tyr 
TAT 


950 
ala 
GCC 


arg 
CGG 


thr 
ACC 


ser 
TCC 


ile 
ATC 


arg 
AGA 


ala 
GCC 


ser 
AGT 


leu 
CTC 


thr 
ACC 


960 
phe 
TTC 


asn 
AAC 


arg 
CGC 


gly 
GGC 


phe 
TTC 


lys 
AAG 


ala 
GCT 


gly 
GGG 


arg 
AGG 


asn 
T^C 


970 
met 
ATG 


arg 
CGT 


arg 
CGC 


lys 
AAA 


leu 
CTC 


phe 
TTT 


gly 
GGG 


val 
GTC 


leu 
TTG 


arg 
CGG 


980 
leu 
CTG 


lys 
AAG 


cys 
TGT 


his 
CAC 


ser 
AGC 


leu 
CTG 


phe 
TTT 


leu 
CTG 


asp 
GAT 


leu 
TTG 


990 
gin 
CAG 


val 
GTG 


asn 
AAC 


ser 
AGC 


leu 
CTC 


gin 
CAG 


thr 
ACG 


val 
GTG 


cys 
TGC 


thr 
ACC 


1000 
asn ile 
AAC ATC 


tyr 
TAC 


lys 
AAG 


ile 
ATC 


leu 
CTC 


leu 
CTG 


leu 
CTG 


gin 

CAG 


ala 
GCG 


1010 
tyr arg 
TAC AGG 


phe 
TTT 


his 
CAC 


ala 
GCA 


cys 
TGT 


val 
GTG 


leu 
CTG 


gin 
CAG 


leu 
CTC 


1020 
pro phe 
CCA TTT 


his 
CAT 


gin 
CAG 


gin 
CAA 


val 
GTT 


trp 
TGG 


lys 
AAG 


asn 
AAC 


pro 
CCC 


1030 
thr phe 
ACA TTT 


phe 
TTC 


leu 
CTG 


arg 
CGC 


val 
GTC 


ile 
ATC 


ser 
TCT 


asp 
GAC 


thr 
ACG 


1040 
ala ser 
GCC TCC 


leu 
CTC 


cys 
TGC 


tyr 
TAC 


ser 
TCC 


ile 
ATC 


leu 
CTG 


lys 
AAA 


ala 
GCC 


1050 
lys asn 
AAG AAC 


ala 
GCA 


gly 
GGG 


met 
ATG 


ser 
TCG 


leu 
CTG 


gly 
GGG 


ala 
GCC 


lys 
AAG 


1060 
gly ala 
GGC GCC 


ala 
GCC 


gly 
GGC 


pro 
CCT 


leu 
CTG 


pro 
CCC 


ser 
TCC 


glu 

GAG 


ala 
GCC 


1070 
val gin 
GTG CAG 


trp 
TGG 


leu 
CTG 


cys 
TGC 


his 
CAC 


gin 

CAA 


ala 
GCA 


phe 
TTC 


leu 
CTG 


1080 
leu lys 
CTC AAG 


leu 
CTG 


thr 
ACT 


arg 
CGA 


his 
CAC 


arg 
CGT 


val 
GTC 


thr 
ACC 


tyr 

TAC 


1090 
val pro 
GTG CCA 


leu 
CTC 


leu 
CTG 


gly 
GGG 


ser 
TCA 


leu 
CTC 
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1100 1110 
arg thr ala gin thr gin leu ser arg lys leu pro gly thr thr 
AGG ACA GCC CAG ACG CAG CTG AGT CGG AAG CTC CCG GGG ACG ACG 

1120 

leu thr ala leu glu ala ala ala asn pro ala leu pro ser asp 
CTG ACT GCC CTG GAG GCC GCA GCC AAC CCG GCA CTG CCC TCA GAC 

1130 1132 
phe lys thr ile leu asp OP 

TTC AAG ACC ATC CTG GAC TGA TGGCCACCCGCCCACAGCCAGGCCGAGAGCAGA 

CACCAGCAGCCCTGTCACGCCGGGCTCTACGTCCCAGGGAGGGAGGGGCGGCCCACACCC 

AGGCCCGCACCGCTGGGAGTCTGAGGCCTGAGTGAGTGTTTGGCCGAGGCCTGCATGTCC 

GGCTGAAGGCTGAGTGTCCGGCTGAGGCCTGAGCGAGTGTCCAGCCAAGGGCTGAGTGTC 

CAGCACACCTGCCGTCTTCACTTCCCCACAGGCTGGCGCTCGGCTCCACCCCAGGGCCAG 

CTTTTCYTCACCAGGAGCCCGGCTTCCACTCCCCACATAGGAATAGTCCATCCCCAGATT 

CGCCATTGTTCACCCYTCGCCCTGCCYTCCTTTGCCTTCCACCCCCACCATCCAGGTGGA 

GACCCTGAGAAGGACCCTGGGAGCTCTGGGAATTTGGAGTGACCAAAGGTGTGCCCTGTA 

CACAGGCGAGGACCCTGCACCTGGATGGGGGTCCCTGTGGGTCAAATTGGGGGGAGGTGC 

TGTGGGAGTAAAATACTGAATATATGAGTTTTTCAGTTTTGRAAAAAAAAAAAAAAAAAA 

AAAAAAAT^AA 
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